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Abstract 

We obtain new principles for transferring log-Sobolev and Spectral-Gap inequalities 
from a source metric-measure space to a target one, when the curvature of the target 
space is bounded from below. As our main application, we obtain explicit estimates for 
the log-Sobolev and Spectral-Gap constants of various conservative spin system models, 
consisting of non-interacting and weakly-interacting particles, constrained to conserve 
the mean-spin. When the self-interaction is a perturbation of a strongly convex po- 
tential, this partially recovers and partially extends previous results of Caputo, Chafa'i, 
Grunewald, Landim, Lu, Menz, Otto, Panizo, Villani, Westdickenberg and Yau. When 
the self-interaction is only assumed to be (non-strongly) convex, as in the case of the 
two-sided exponential measure, we obtain sharp estimates on the system's spectral-gap 
as a function of the mean-spin, independently of the size of the system. 

1 Introduction 

The log-Sobolev and spectral-gap (or Poincare) inequalities are among the most fundamen- 
tal and useful functional inequalities for the analysis of equilibrium and non-equilibrium 
statistical mechanical systems. For instance, in the context of n-particle spin-systems, it is 
known that under various typical conditions (see [52, 50, 7, 29]), the existence of a uniform 
lower bound on the spectral-gap or log-Sobolev constant associated to a spin-system, inde- 
pendent of boundary conditions and system-size, is equivalent to the exponential decay of 
spin-spin correlations in the distance between sites, and is thus a strong manifestation of 
the existence of a unique phase in the thermodynamic limit. However, for many natural 
non-trivial models, it is by no means an easy task to obtain uniform or other quantitative 
bounds on the spectral-gap or log-Sobolev constants. To elucidate this point, let us start 
by introducing our protagonists. 

Let (Cl,d,fi) denote a measure-metric space, meaning that (£l,d) is a separable metric 
space and \x is a Borel probability measure on (Q,d). Let T = J-(£l,d) denote the space 
of functions which are Lipschitz on every ball in (fl, d). Given / 6 J, define |V/| as the 
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following Borel function: 



|V/|(x) := hmsup — . 

d(y,x)^0+ «l x > V) 

(and we define it as if £ is an isolated point - see [6, pp. 184,189] for more details). In the 
smooth Euclidean setting, | V/| of course coincides with the Euclidean length of the gradient 
of /. The study of the log-Sobolev inequality was initiated in the works of Federbush [14] 
and Gross [18] (cf. Stam [49]), and is a key feature of the Gaussian measure (see e.g. [28] 
for a general introduction and applications): 

Definition, (fi, d, p) is said to satisfy a log-Sobolev (LS) inequality with constant p > 
(LSI(p)) if: 

V/G.F ^Ent M (/ 2 )< J \Vf\ 2 dp, (1.1) 
where Ent^(p) denotes the entropy of a non-negative function g: 

Ent M (g) := J gloggdp- (J gdp^j log (J gdpj . 

The best possible constant p above is denoted by pis = PLs(^,d, p). 

Definition. (Q.,d,p) is said to satisfy a spectral-gap (SG) inequality with constant p > 
(SG{p)) if: 

V/ e T pVar M (/) < j \Vf\ 2 dp , (1.2) 
where Var /i (/) denotes the variance of f: 



Var M (/) := J fdp- (Jfd»\ 



The best possible constant p above is denoted by psc = PSG(^,d, p). 

By linearizing the LS inequality around constant functions, it is easy to verify that 
PSG > Pls (see e.g. [28]). The spectral-gap psa controls the rate of convergence of an 
appropriate diffusion to the stationary measure p in the variance sense, whereas the stronger 
LS constant pis controls the rate of convergence in the entropy sense. By the tensorization 
property and the Bakry-Emery criterion, both types of inequalities are well-known to hold 
on products of spaces satisfying the corresponding inequality and on strongly convex spaces, 
respectively (see Appendix for details). 

One natural way of obtaining (lower) bounds on p^$ or psc-, is to start from a well- 
understood space (Q,d,p), and to transfer the LS or SG inequality from that space to 
a perturbation thereof. To this end, a transference principle or stability result for these 
inequalities is required. The most common type of perturbation is when the underlying 
metric space (fJ, d) remains fixed and only the measure p is perturbed, and this is the 
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case we will consider here. A very well-known transference principle for the log-Sobolev 
inequality is given by the Holley-Stroock lemma [21], which states that if m and m are 
two mutually absolutely continuous Borel probability measures on (O, d), then: 



dfi 2 


< L 2 , 


dm 


dm 


L°° 


dm 



PLs(^,d,m) > 



1 



L°° 



Li L 



1-^2 



-p LS (£i,d,m) 



(1.3) 



A completely analogous statement trivially holds for the SG inequality. Unfortunately, 
although being an extremely useful tool, a naive application of the Holley-Stroock lemma 
in a high-dimensional situation (consider simultaneously perturbing each single-site poten- 
tial) will typically lead to an exponential degradation of pls in the dimension, severely 
obstructing any hope of obtaining uniform bounds. 

Our aim in this work is to present several alternative general transference principles 
for the LS and SG inequalities, and as an application, to test their performance on a 
conservative spin model with and without weak-interactions. In this Introduction, we will 
put more emphasis on describing the application to conservative spin models, but before 
proceeding with this application, we briefly describe our general transference principles. 



1.1 The Transference Principles - A Brief Taste 

The Holley-Stroock perturbation principle (1.3) requires uniform upper and lower bounds 
on log(dm/dm)- Our transference principles rely on an upper bound on \\dml drnWip^) 
for some p > 1, allowing dm/ dm to vanish or to explode on a /ii-small part of the space. 
They also rely on geometric assumptions on the spaces involved: we require that (0, d) be 
given by a complete oriented smooth connected Riemannian manifold (M,g), endowed with 
its natural geodesic distance d, and so that (M,g,m) satisfies an appropriate curvature 
lower-bound condition. For the SG inequality, we require the curvature to be non-negative, 
whereas for the LS inequality we allow it to be bounded below by —k, but in addition 
require that pls{M, g, m) is big enough with respect to k. Our method is based on the 
equivalence between isoperimetric and concentration inequalities in the latter setting due 
to the second named author [38, 39], and extends the transference principles obtained in 
[38, 40]. Let us state now a sample result and refer to Section 2 for a more comprehensive 
account: 

Theorem 1.1 (log-Sobolev Transference under Curvature Lower Bound - Euclidean Set- 
ting). Let m = exp(— Vi(x)) dx (i = 1, 2) denote two Borel probability measures on Euclidean 
space (W 1 , 1 • |) ; and assume that V € C 2 (W n ) and that HessV2 > —Kid (k>0). Assume 
that (M n , I • \,m) satisfies a strong-enough log-Sobolev inequality: 

p = p L5 (M n ,|-|,Mi)>^-K, (1-4) 

p-1 

for some p > 1, and that: 
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Then 



where: 



\,P2) satisfies a log-Sobolev inequality: 

p L s(R n ,\-\,fi 2 ) >C(p,K,L, P ) 



C(p,K,L,p) := cp 



P 



1 



exp(-C(l + log(L))/0) 



1 



4pK 



p (p- l)p ' 

and c, C > are universal numeric constants. Moreover, when k = 0, one may in fact use: 

p-1 1 



C(p,0,L,p) =cp 



p 1 + log(L) 



(1.5) 



Note the logarithmic dependence on the perturbation parameter L in (1.5) when k = 
(i.e. when Vi is convex), improving over the linear dependence in (1.3). A similar result 
holds for the SG inequality when n = (and in fact much more may be said, see Subsection 
2.4). See Remark 3.10 for more on the constant 4 in (1.4). 

Obtaining dimension-free estimates of the malleable quantity \\dp2/dpi\\ LP ^ i ^ is already 
a more feasible task for many natural models, as we demonstrate in the case of the conserva- 
tive spin model. Furthermore, the convenience of controlling this quantity is especially ap- 
parent when superimposing several perturbations of p\. For instance, if p 2 = fdPi/ J fgdpi 
with fg > 0, then by several applications of Cauchy-Schwartz we may estimate: 



dpi 



dpi 



< 



n>(w) (/ ' fadpiY 

1/2 



< 



f 2p dp, 



1/2 



\ 1/2 

9 2p dpA 



f p dp, 



9 2p dp! 



1/2 



/ 2 dp: 



p/2 



9 2 dpij 



p/2 



;i.6) 



and so it is enough to analyze each perturbation (/ and g) of p\ separately. We illustrate 
this point by adding weak-interactions to our conservative spin model in Section 4. 



1.2 The Conservative Spin Model 

Let V 6 C 2 (IR) denote a single-site potential, so that p = exp(— V(x)) dx is a probability 
measure on M. with barycenter at the origin. Let p n := p® n = exp(—H(x))dx denote the 
Gibbs measure on M. n corresponding to the grand canonical ensemble of non-interacting 
spins, where H(x) denotes the non-interacting Hamiltonian H{x) = ^7=1^ ( x i)- ^ ^ ne 
well-known tensorization property of LS and SG inequalities (see Appendix), it follows that 
/9/(M n , | • \,p n ) = | • |) A*)> f° r I = LS, SG, and so the product space (W 1 , \ ■ \,p n ) is 

well- understood. Here and elsewhere, M n is equipped with its standard Euclidean structure 

The measure pe s , corresponding to the canonical ensemble having mean-spin s G R, is 
obtained by restricting p n onto the hyperplane E s given by 1 Y17=i Xi = s ' namely: 

VE a = exp(-H(x)) dvo\ Ee (x) , (1.7) 
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where we denote by vo\e s the induced Lebesgue measure on the hyperplane E s , and by 
Ze 3 > a normalization term. 

In [51], Varadhan asked to characterize those potentials V so that (E s , \ ■ |,//e s ) satisfies 
a SG inequality, uniformly in the system-size n > 2 and mean-spin s £ R. Varadhan's 
question naturally extends to the LS inequality case as well. When V is a C 2 perturbation of 
a (strongly convex) super-quadratic potential and under some mild technical assumptions, a 
positive answer to Varadhan's question for SG was obtained by Caputo [9]. A similar answer 
cannot be expected in the sub-quadratic case, as follows from the work of the first named 
author and Wolff. In [3], these authors showed that when \x is the exponential measure on 
R + (and more generally, when [i is the Gamma distribution), then psg(E s , \-\,/J>E a ) behaves 
like 1/s 2 (albeit independently of n), and that pls{E s , \ ■ \,Pe s ) behaves like 1/ns 2 . As for 
the corresponding questions in the LS case, various authors have addressed (using different 
methods) the case when V is a C 2 perturbation of a quadratic [24, 11, 19], starting from 
the work of Lu and Yau [33]. In particular, Grunewald, Otto, Villani and Westdickenberg 
analyzed the LS case in [19] using a two-scale (macroscopic - microscopic) approach, and 
established the system's convergence to the hydrodynamic limit. Building on this two-scale 
approach and extending it to a multi-scale one, Menz and Otto succeeded in [37] to obtain 
a positive answer to the LS version of Varadhan's question when V is a C 1 perturbation 
of a (strongly convex) super-quadratic potential. We have recently learned that Fathi and 
Menz (in preparation) have succeeded to reprove this result by using the original two-scale 
approach. 

Subsequently, Menz [36] obtained a positive answer for LS in the presence of weak- 
interactions between spins, when V is a C 2 perturbation of a quadratic potential. Given 
a symmetric n by n matrix A = {a>ij} with zero diagonal (a^j = and a^i = 0) and a 
vector b € R n , the spin-interaction term — Ia{x) and boundary contribution term B^x) are 
defined and added to the non-interacting Hamiltonian H as follows (see Section 5 for more 
details): 

n n 

HA,b{x) = H(x) + B b (x) - I A (x) , B b (x) = ^biXi , I A (x) := a^XiXj . 

i=l i,j=l 

The corresponding weakly-interacting conservative Gibbs measure (J>AE a b is defined as in 
(1.7) by replacing H with H A ,b] when b = 0, we simply denote it by ha,e s - 

1.3 Applying the Transference Principles 

In this work, we apply our transference principles to partially recover and partially extend 
the above mentioned results. Before stating a sample of our results, let us briefly describe 
our method. By a standard application of Cramer's trick, which is the key ingredient in 
the Cramer Theorem on Large Deviations - a central tool in the approaches of [19, 37, 36] 
- we reduce the analysis of the general mean-spin case s £ R, to the case s = 0. The idea 
is to replace fx with fi Aa := c a fiexp(ax), for an a = a(s) appropriately chosen so that the 
barycenter of at s, and to note that (E s , \-\,he 3 ) is isometric (as a metric- measure 
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space) to (E, | • |, (/j, a ^)E), where [i a is the translation of /i Aa having barycenter at 0, and 
where we denote E := Eq. This trick also applies in the presence of weak-interactions and 
boundary contribution, but there the argument is more subtle, and requires the existence 
of a solution to a system of non-linear equations, which is guaranteed by an application of 
Banach's fixed point theorem (see Lemma 5.4). 

Next, note that \ie is not absolutely continuous with respect to /x n , and so to be able to 
apply our transference results, we define he,w by "thickening" \xe uniformly in the diagonal 
direction D by a width of w > from each side: 

d^E,w{x) = -=— exp (-H(tt e (x))) l\ir D ( x )\< w dx , Z E , W = 2wZ E , 

where tte denotes orthogonal projection onto the subspace F. The measure HA,E,b,w is 
defined analogously, replacing H with HA,b above. 



1.3.1 Obtaining log-Sobolev inequalities 

By controlling \\d[iE,w/d[i n \\ Lv ^ s for e.g. p = 4 and applying Theorem 1.1, the LS inequal- 
ity is transferred from (W 1 , \ ■ \ ,fj, n ) onto (R n , | • \,HE,w) under appropriate assumptions on 
V; and from the latter space, which is a product of (E, \ ■ |, he) and the uniform measure on 
an interval of length 2w on D, the LS inequality is immediately established on (E, \ ■ \ , he), 
and it remains to optimize over w > 0. 

To control \\d/iE,w / 'd^nWip^ ) for say p > 0, we require upper estimates, exponentially 
decaying in u, of: 

Hn {x G R n ; \ir D (x)\ < w , H{x) - H(ir E {x)) > u} 

{ran ^ 
x G R n ; \ir D {x)\ < w , ^ V{ Xi ) - V(ir E (x)i) > u \ 
i=l i=l J 

x G R n ; 3t G [-w, w] , ^ V ( x i) ~ £ V ( Xi + - u \ ' 

i=l i=l J 

Applying Taylor's theorem and expanding V(xi + t/y/n) to the second order, it remains to 
control the large deviation of: 




i=i 



>«ij , p ^ it(Yi - E(y )) > ^2 



where Xi , . . . , X n and Y\ , . . . , Y n are independent random variables having distribution 
identical to that of (respectively) Xq and Yq := sup£ 6 pr _a t x +s\ f° r a fixed 5 > 0, 

where Xq is distributed according to fj,. Since E(V'(Xo)) = / V'(x) exp(—V(x))dx = 0, it is 
clear by the Central-Limit Theorem and the Law of Large Numbers that both terms above 
converge as n — > oo. However, to obtain estimates valid for each individual n, we resort 
to other classical large deviation tools such as Bernstein's theorem. Furthermore, if V" is 
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bounded below, then control over the second order term is automatic. Finally, to control 
the normalization term Ze, we apply the Berry-Esseen Theorem or a local version thereof. 

Superimposing weak-interaction on our model is also easily handled, by separately an- 
alyzing it using (1.6). To this end, we use a non-symmetric version of the Hanson-Wright 
order-two sub-Gaussian chaos deviation estimates [20], kindly communicated to us by Rafal 
Latala, to whom we are grateful (see Appendix). 

The following notation will be used throughout this work. Recall that the norm of 
a random-variable Y, denoted ||Y||r , is defined as follows: 

\\Y\\ L ^ := inf {A > ; Eexp(|Y|/A) < e} . 

Observe that || - is indeed a norm, thanks to the convexity of the exponential function, 
and that our normalization ensures that II 111 T = 1- Set: 

II "Ly 1 

• M p = M p (p) := K\X \p. 

• D 1 ^ 1 =D 1 , 9l (p) := \\V'(X )\\ L9i . 

• #2,*!=^ := l|Yo|| L ^. 

. D 2 = D 5 2 = D s 2 (jj) := E(Y ). 

Finally, we denote the dependence of a certain parameter a on other parameters (3,j,8,e 
as follows: a^'J means that a only depends on upper bounds on (3,e and on lower bounds 
on 5, e. 

We now provide a few samples to illustrate the types of results we obtain: 

Theorem 1.2. Let p = exp(— V{x)) dx denote a probability measure on M. with barycenter 
at so that Di^ l = Di^^p) < oo and A := \\dfi/dx\\ Lx < oo. Assume in addition that 
(R, | ■ |,/i) satisfies LSI(p), and that: 

-«:= inf V"(x) > -- . (1.8) 

Then for any integer n greater than r\ = r\ Mi l M ^ ' M 3 a3 ; the conservative zero-mean spin 
system (E, \ • |,/ie) satisfies LSI with constant: 

Pls{E, | ■ \,fi E ) > c-^ , 

where c, C > are universal constants and Q is the following scale-invariant quantity: 

Q := max(l, + Wl )M 2 ) . 

Theorem 1.3. Let p = exp(— V{x)) dx denote a probability measure on M. with barycenter 
at the origin, so that D\^ 1 = D\^ 1 (p) < oo and -D2,*i = D 2 ^ (p) < oo for some 6 > 0. 
Let A denote an n by n symmetric matrix with zero diagonal. Assume that (R, | • \,p) 
satisfies LSI(p), and that: 
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K : = inf V"(x) > -? ; 

zeR 8 



(1.9) 





1 ' ' 1 , the weakly-interacting con- 
Ha,e) satisfies LSI with constant: 



where c, C > are universal constants and Q is the following scale-invariant quantity: 



Here and throughout we denote the operator and Hilbert-Schmidt norms of A by 
and ||^4||#c, respectively. To demonstrate the desired uniformity in system-size n > 2, 
mean-spin s G K and boundary contribution b G R n for a concrete class of measures fi, we 
consider the following: 

Definition. The probability measure fx = exp(— V(x)) dx is called (a, /3, u) weakly Gaussian 
if we may decompose V = V conv + V per t so that: 



• SUp Vp er t — inf Vpert < W < OO . 

• — k := -|aexp(— u>) <V"<(3<oo. 

Theorem 1.4. Let fx be a (a,/3,w) weakly Gaussian measure, and let A denote an n by n 
symmetric matrix with zero diagonal satisfying: 



for an appropriate universal constant c > 0. Then the canonical ensemble (E s , \-\ ,HA,E a b) 
with weak-interaction A, satisfies a LSI, uniformly in the system size n > 2, mean-spin 
value s G K and boundary contribution b G M. n , depending solely on a positive lower bound 
on a and upper bounds on [3, uj and \\A\\ HS /(aexp(— oj)). 



Q := max (l, M 2 (D 2 + D^J) exp(P||^ s / p 2 ) . 



V conv , Vpert G C 2 (M). 



V" 



conv — 



> a > 0. 



A\\ < caexp(-w) 
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1.3.2 Obtaining Spectral-Gap inequalities 

We now turn our attention to the weaker SG inequalities. Unfortunately, our transference 
principle for the SG inequality only applies (at least in the Euclidean case) when the target 
measure is log-concave. Recall that a measure v = exp(— W(x)) dx on W 1 is called log- 
concave if W : W 1 -> 1U {00} is convex. We naturally consider a scenario where no LS 
inequality is possible, when the log-concave probability measure p = exp(— V(x)) dx does 
not have sub-Gaussian tail decay and the potential V is not necessarily strongly convex: 

Theorem 1.5. Let p = exp(—V(x))dx denote a log-concave probability measure on R. 
Assume that either: 

1. V is Lipschitz with constant L. 

2. V £ C 1 (IR) and V is Lipschitz with constant L? . 

Given s£l, denote p s := psgQ&i 
s E R: 

Psg(E s , I 

where c> is a universal constant 

We show in Section 6 that any log-concave probability measure satisfies the opposite 
inequality: psg(E s ,\ ■ \,Pe 3 ) < Cp s , for some universal constant C > 0. A well-known 
conjecture in convexity of Kannan, Lovasz and Simonovits [22] predicts that the logarithmic 
term in (1.10) and the restrictions on the log-concave measure p in Theorem 1.5 may be 
removed. In the special case of the two-sided exponential measure v = ^ exp(— |a;|)cfe, we 
are indeed able to resolve this conjecture, which translates into: 

Theorem 1.6. Let v = \ exp(— dx. Then the canonical ensemble (E s , | • |, ve s ) satisfies: 

Psg(E s , I • \,u Es ) ~ 1 , 
1 + s z 

uniformly in the system- size n > 2 and mean- spin s € M. 

Here and throughout, we use A ~ B to denote that c\ < A/B < C2 for some two 
universal constants ci,C2 > 0. Theorem 1.6 answers a question suggested to us by Pietro 
Caputo, whom we would like to thank, which served as motivation for this entire work. His 
question was inspired by the relationship between this specific canonical ensemble and a 
continuous version of the one-dimensional Solid-on-Solid model. See [10] for recent results 
and references. 

1.4 Comparison With Previous Works 

Theorem 1.1 should be compared with the Otto-Reznikoff criterion for LSI [46]. Roughly 
speaking, the latter result applies on a product space, where the i-th factor satisfies LSL(pi), 



\,p a ( s )). Then for any integer n > 2 and mean-spin 

\,VE s )>c- — Ps 2 2 , (1.10) 
log(2 + L z /p s y 
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and a smallness condition (relative to the {pi}'s) on the interaction between different factors 
is required, in the form of a lower bound on the cross- factors' Hessian. Our Theorem 
1.1, which applies to an arbitrary not-necessarily- product space, also requires a smallness 
condition on the lower bound of the Hessian of the target measure //2's potential, relative 
to the LS constant of the source measure However, this smallness is required not just 
for the "cross-terms" , but also for the "diagonal-terms" in some sense. 

Our approach to functional inequalities for conservative spin systems, based on trans- 
ference principles, is novel and has several advantages. Firstly, it follows closely the natural 
heuristics: if J xdfj,{x) = and the Xj's are independent copies distributed according 
to n, then by the Law of Large Numbers and Central Limit Theorem, conditioning on 
Y^i=i Xi = should be less and less stringent as n increases, and hence the measure \xe 
is expected to behave similarly to the product measure jjL n \ our proof precisely mimics this 
heuristic argument. Secondly, our approach is global and allows to deduce estimates on 
the ergodic constants for an individual value of the mean spin s which hold uniformly in 
the dimension of the system. In contrast, previous methods relied on a kind of induction 
on sub-systems, which seems to work well only when the ergodic constants are bounded 
independently of the dimension n and the spin s. For this reason, they could not be applied 
to sub-quadratic potentials, where a dependence in s is expected. Finally, our approach is 
soft and flexible. It adapts to weak interactions and is likely to be useful for other models. 

Next, let us comment on our results in comparison to previous ones. Our spectral-gap 
estimates for the conservative spin-model with convex self-interactions are new and almost 
sharp. Apart from the work [3] on log-concave gamma distributions, which used very specific 
properties of these laws, our results are the first dimension-free estimates for potentials with 
a sub-quadratic growth. 

Clearly, our results on the LS inequality for the conservative spin-models, being based 
on our general transference principle, do not fully recover the best known results described 
earlier. Our main limitation lies in the requirements (1.8) and (1.9) that the LSI constant of 
(K, I ■ I , fi) is strong-enough relative to the curvature lower-bound satisfied by the potential 
V. Consequently, we cannot handle the case of an arbitrarily large C 2 perturbation of a 
quadratic potential as in [33, 24, 11, 19], let alone C 1 perturbations as in [37]. However we 
are able to handle any weakly Gaussian measure, i.e. small C 2 perturbations of not just a 
quadratic potential as in [33, 24, 11, 19, 36], but of any potential V with < a < V" < f3 < 
00. Furthermore, if no upper bound on V" is assumed, we can still handle the zero mean-spin 
case under a very mild technical assumption (essentially, that V grows sub-exponentially); 
handling arbitrary mean-spin values via our method seems technically involved, but not 
impossible. Therefore in the non-interacting case, we propose a soft and robust approach 
to known results. Our approach avoids in particular the delicate and technical proofs of 
local versions of Cramer's Theorem, which are essential ingredients in the approaches of 
[19, 37, 36], and which must be reproved for every small variation of the underlying model 
(and in some cases, as in the weakly-interacting model of [36], are obtained by an indirect 
perturbation argument). 

In the presence of weak- interactions, we partially recover and partially extend the work 
of Menz [36], which deals with arbitrary C 2 bounded perturbations of a quadratic potential. 
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Passing to a more general potential V (even one with < a < V" < (3 < oo) is a genuine 
issue for the approach of [36], due to the perturbative nature of the argument. In contrast, 
we can also deal with weakly Gaussian potentials if the interaction is small enough. Fur- 
thermore, the result of [36] requires that || \A\ || be small-enough, where \A\ := {|a£j|} for 
A = {cjjj}. Our estimates only require the smallness of ||.A|| , which is always smaller (in 
the worst case by a factor y/n): 

\\A\\ op < || \A\ || op < || \A\ \\ HS = \\A\\ HS < ^ U\\ op . 

However the constants appearing in our estimates also depend on the value of ||^4.||#s- 
So this advantage would only be apparent in mean-field type situations, or at least when 
\\A\\ HS I \ A || is upper-bounded independently of the dimension. 

To summarize, we present in this work a single framework for simultaneously handling 
several spin-models, which may be easily extended to other models or scenarios. All of 
our estimates are stated in terms of explicit dependencies on several concrete parameters 
extracted from the initial single-site measure \i. 

The rest of the paper is organized as follows. Section 2 presents our transference prin- 
ciples for the LS and SG inequalities. In Section 3, we deduce a uniform bound in the 
system size on the LSI constant of a conservative spin system with zero-mean spin, under 
various assumptions on the self-interaction. Section 4 achieves similar results for a weakly- 
interacting model. In Section 5, we extend the uniformity of our bounds for both models 
to arbitrary mean-spin values. In Section 6, we turn our attention to the SG inequality 
when the self-interaction is assumed (non-strictly) convex. We obtain uniform bounds on 
the system size, whose dependence on the mean-spin is essentially sharp. The Appendix 
collects various useful auxiliary statements. 

Acknowledgements. We thank Rafal Latala and Sasha Sodin for their help with refer- 
ences and estimates, and Pietro Caputo, Georg Menz and Cedric Villani for their comments 
and interest. E.M. would also like to thank Thierry Bodineau and Tom Spencer for intro- 
ducing him to Spin Systems. 

2 The Transference Principle 
2.1 Concentration Transference 

Let ($7, d, fi) denote a measure-metric space, meaning that (fi, d) is a separable metric 
space and fi is a Borel probability measure. Given such a space, its concentration profile 
K = K(Sl,d,/j) : R+ ->■ [0, 1/2] is defined by: 

K(r) := sup {^(O \ A*) ; p(A) > 1/2} , A d r := {x e £1 ; d(x, A) < r) . 

It is immediate to check that this is equivalent to requiring that 1C is minimal with: 

fi(A) > K{t) => fi(Af) > 1/2 . 
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Using the "pessimistic" convention for the inverse of a non-increasing function: 

KT^e) := inf{r > ; /C(r) < e} . 

it follows that: 

fj,(A)>s => n(A$)>l/2 for r = Kr\e) . (2.1) 

Concentration inequalities provide upper bounds on the decay of IC(r) in the large-deviation 
regime, as r — > oo. Note that trivially /C(r) < 1/2 for all r > 0, and so the concentration 
inequality IC(r) < a(r) is only meaningful when r > a~~ 1 (l/2). 

The following crucial lemma is a generalization of [40, Lemma 3.1]. 

Lemma 2.1 (Concentration Transference). Let fJ,i,fJ, 2 be two probability measures on a 
common metric space (0, d), with fi 2 "C /ii- Let JCi = IC^q ^^ denote the corresponding 
concentration profiles. Assume that there exists a right- continuous non-increasing function 
M : (0,1/4] (0,oo) so that: 

Ve G (0, 1/4] , /x 2 (jx E 0; |^(x) > M(e)}) < e. 

T/ien /or all r > 2/C^ 1 (/3(l/4)), 

/C 2 (r) <2/3" 1 (/C 1 (r/2)) , (2.2) 

where (3 : [0,1/4] — > [0,/3(l/4)] denotes the increasing function (3(e) := e/M(e), with the 
convention that (3(0) = 0. 

Proof. Let A C ft with /x 2 (^4) > 1/2. Given e E (0, 1/4], set 

ft £ := jx E ft; ^-(x) < M(e)\ . (2.3) 

By hypothesis ^ 2 (VL £ ) > 1 — e. Denote A 6 := A n ft e , and observe that //2(^4 e ) > 1/2 — e 
and hen ce /xi(^ £ ) > 5 e := (1/2 - e)/M(e). Denoting r = /C^ 1 ^), it follows from (2.1) 
that n\((A £ ) ri ) > 1/2, and hence for any r > 0: 

A*i(n e \ (^ £ ) n +r) < \ (^%+r) < /^(r) . 

Using (2.3) again, we obtain: 

» 2 (n e \ A ri+r ) < /i 2 (ft £ \ (A e ) ri+r ) < M(e)K 1 (r) , 

and thus: 

fi 2 (Q \ A ri+r ) < fi 2 (n\n £ ) + fi 2 (n £ \ A ri+r ) < e + M(e))C 1 (r) . 
It follows that for all e E (0, 1/4], 

K 2 (r + Ki 1 (^I^X\ < e + M(e)/Ci(r) Vr > , Ve E (0, 1/ 4] . (2.4) 



12 



Now given r > 2X:^ 1 (/3(l/4)), we set e := ft- 1 {Ki{r / 2)) £ [0,1/4], ensuring (by the 
right-continuity of /3) that Eo/M(so) > K\{r/2). Since: 

we obtain from (2.4): 

IC 2 (r) <e + M(e )/Ci (r - /C^ 1 f ^")" )) < £ o + M(e )/C 1 (r/2) < 2e , 

and (2.2) follows. □ 

When M(e) = M for all e £ (0, 1/4], that is when d^/d^i is essentially upper bounded, 
the above concentration transference result appeared in [40, Lemma 3.1] with better numer- 
ical constants. However the previous lemma adapts to cases when the density is unbounded. 

Proposition 2.2 (Concentration Transference - Integral Form). Let \x\, fi2 be two proba- 
bility measures on a common metric space (Q,d), verifying [12 <C Let KLi = /C^d,^), 
i £ {1,2} be their concentration profiles. Let G : —> denote a continuous function 
increasing to infinity, and set F(x) = xG(x) on R+. Assume that: 

Then: 

JC 2 (r) < 2/Ci(r/2)F" 1 (L//Ci(r/2)) Vr > . 
Proof. By the Markov-Chebyshev inequality: 

It follows that the hypothesis of Lemma 2.1 is satisfied with M(e) := G~ 1 (L/e). Observe 
that: 

F(M(e)) = M{e)G(M(e)) = , 
where as in the lemma, /3(e) := e/M(e). Consequently: 

P~\x) = M-\F-\L/x)) = —JL— = xF-\L/x) , 

and the conclusion of Lemma 2.1 yields the desired assertion. □ 
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2.2 Concentration Vs. Isoperimetry 

In the "semi-convex setting" , when an additional lower-bound assumption on an appropri- 
ate (generalized Ricci) curvature associated to the target space (Cl,d,/i2) is imposed, it is 
possible, repeating the program put forth in [40], to utilize the results of [38, 39] on the 
equivalence between isoperimetric and concentration inequalities in that setting, and trans- 
late the transference principle obtained above from the concentration to the isoperimetric 
level. As a by-product, a transference principle may be deduced for Sobolev-type inequal- 
ities in the semi-convex setting. We will only briefly describe this program here for the 
log-Sobolev and spectral-gap inequalities, as these seems the most interesting and relevant 
ones for applications, and refer to [40] for results on the transference (or stability) of general 
isoperimetric and Sobolev-type inequalities and for missing details. 

Recall that Minkowski's (exterior) boundary measure of a Borel set A G CI, which we 
denote here by fi + (A), is defined as fi + {A) := liminf £ _>o ^ A ^~^ A ^> ; where A £ = Af := 
{x E Q; By £ A d(x, y) < e} denotes the e extension of A with respect to the metric d. The 
isoperimetric profile I = I^fi,d,fi) is defined as the pointwise maximal function I : [0, 1] — > 
K+, so that n + (A) > I(/j,(A)), for all Borel sets A C Cl. An isoperimetric inequality 
measures the relation between the boundary measure and the measure of a set, by providing 
a lower bound on Since A and Q\A will typically have the same boundary measure, 

it will be convenient to also define X : [0, 1/2] — > M + as I(v) := min(X(v),X(l — v)). 

The two main differences between isoperimetric and concentration inequalities are that 
the latter ones only measure the concentration around sets having measure 1/2, and only 
provide large-deviation information on the measure of extensions of these sets, contrary to 
the infinitesimal information provided by the former ones. We refer to [28, 43, 40] for a 
wider exposition on these and related topics and for various applications. 

It is known and easy to see that an isoperimetric inequality always implies a concentra- 
tion inequality, simply by "integrating" along the isoperimetric differential inequality (see 
e.g. [40]), but the converse implication is in general false, due to the possible existence of 
narrow "necks" in the geometry of the space (£1, d) or the measure [i. However, when such 
necks are ruled out by imposing some semi-convexity assumptions on the geometry and mea- 
sure in the Riemannian-manifold-with-density setting, it was shown in the second named 
author's previous work [39] (see also [42] and [30]) that general concentration inequalities 
imply back their isoperimetric counterparts, with quantitative estimates which do not de- 
pend on the dimension of the underlying manifold. Semi-convexity thus serves as a bridge 
between the large-deviation and infinitesimal extension scales. The precise formulation is 
as follows. 

Definition. We will say that our smooth K-semi-convexity assumptions are satisfied (k > 
0) if: 

• (f2, d) is given by a complete smooth oriented Riemannian manifold (M, g) with its 
induced geodesic distance. 

• fx is supported on the closure of a geodesically convex domain S C M with (possibly 
empty) C 2 smooth boundary, on which dfi = ex.p(—t/))dvolM\s with tp € C 2 (S), and 
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as tensor fields on S: 

HiCg + HesSgV > — Kg ■ 

We will say that our K-semi-convexity assumptions are satisfied if \x can he approximated 
in total-variation by measures {/U m } so that each (Q,d,fJ, m ) satisfies our smooth K-semi- 
convexity assumptions. 

When k = 0, we will say that our convexity assumptions are satisfied. 

Here Ric 5 denotes the Ricci curvature tensor of (M,g), Hess 9 denotes the Riemannian 
Hessian, and voIm denotes the Riemannian volume form. Ric 9 + Hess 9 '0 is the well-known 
Bakry-Emery curvature tensor, introduced in [32] and developed in [1] (in the more ab- 
stract framework of diffusion generators), which incorporates the curvature from both the 
geometry of (M,g) and the measure [i. When ip is sufficiently smooth and S = M, our k- 
semi-convexity assumption is then precisely the Curvature-Dimension condition CD(—k,oo) 
(see [1]). An important example to keep in mind is that of Euclidean space (M n , [•[) equipped 
with a probability measure exp(— ip(x))dx with Hess tp > —Kid. 

Theorem 2.3 ([39]). Let k > and let a : M+ —> MU{+oo} denote an increasing continuous 
function so that: 

35 > 1/2 3r > Vr > r a(r) > 5 Kr 2 . (2.6) 
Then under our k- semi- convexity assumptions, the concentration inequality: 

/C(r) < exp(-a(r)) Vr > , 

implies the following isoperimetric inequality: 

~ x 
X(v) > min(c,5 vj(log 1/v), c K a ) \/v G [0,1/2] , where -f(x) = — t—^ , (2.7) 

a 1 (x) 

and c<5 ,c K)Q , > are constants depending solely on their arguments. Moreover, if k = 0, 
we may take c$ Q = c and co iQ = |7(log4) for some universal constant c > 0. If n > 0, the 
dependence of c K)Q on a may be expressed only via 5q and a(rg). 



2.3 log-Sobolev Transference under Curvature Lower Bound 

Theorem 2.4 (log-Sobolev Transference under Curvature Lower Bound). Let fi2 *C /^i 

denote two Borel probability measures on a common Riemannian manifold (M,g), and 
assume that (M,g,/i2) satisfies our k- semi- convexity assumptions (k > 0). Assume that 
{M,g,fxi) satisfies a strong-enough log-Sobolev inequality: 

p = p LS (M,g, t i 1 )> —^-k , (2.8) 
p — 1 

for some p > 1, and that: 

'd^ 



dm < L p . (2.9) 

"Mi. 
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Then (M, g,fJ*2) satisfies a log-Sobolev inequality: 

p LS (M,g,p 2 )>C(p,K,L,p) , (2.10) 

where: 

C(p,K,L,p) :=cp^ exp(-C(l+log(L))/0) , := 1 



P {p- VP 

and c, C > are universal constants. Moreover, when k = 0, one may in fact use: 

p-1 1 



C(p,0,L,p) = cp 



p 1 + log(L) 



Proof. Let us denote the isoperimetric and concentration profiles on the corresponding 
spaces by 2, = Ii(M, g,Pi) and /Q = ICi(M,g,pi), i = 1,2, respectively. By the Herbst 
argument (see Appendix), it is known that the log-Sobolev inequality (2.8) implies the 
following Laplace-functional inequality: 

J exp(Xf)dpi < exp(A 2 /(2p)) VA > V 1-Lipschitz / s.t. J fdp x = . 

By passing from expectation to median in the above requirement that J fdp\ = (employ- 
ing the Markov-Chebyshev inequality), and from 1-Lipschitz functions to their level-sets, 
it is easy to check (see e.g. [40, Lemma 4.2]) that this implies the following concentration 
inequality on (M,g,pi): 



/C^expl-lfr-W^I | Vr>0. 



Using Proposition 2.2, we deduce that: 



/C 2 (r) < 2 J £X 1 (r/2) 1 - 1 /p < e xp |tog(2L) " ^ - ]r - ^^J^J I Vr > . 

Since 5q := P % p ^ / k > 1/2 by assumption, K 2 clearly satisfies the decay condition (2.6) 
required to apply Theorem 2.3: 

: = + X / 2 ) > 2 3r o = r o( p ' K ' L ) Vr - r '° ^ 2 ( r ) - ex P(-^o Kr2 ) • 
Consequently, Theorem 2.3 implies that the following isoperimetric inequality is satisfied: 

l 2 (v) > min [ c(5' n )y/pv l0g ^ ,c 

^ ^log 1/w + log(2L) + 2^21512 



> </(p,K,L,p)vy/logl/v VuG [0,1/2] 
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(note that by Jensen's inequality applied to (2.9), L > 1). This means that (M, g, P2) 
satisfies a Gaussian isoperimetric inequality. As described in [40] in greater detail, it is 
known by a result of M. Ledoux [26], refined by B. Beckner (see [27]), that this implies the 
log-Sobolev inequality (2.10) with C(p, k, L,p) = c(c' (p, k, L,p)) 2 , with c > a universal 
constant, concluding the proof. Note that when k = 0, the remarks at the end of the 
formulation of Theorem 2.3 imply that one may use: 



above, with c > a universal constant. When k > 0, a tedious inspection of the estimates 
obtained in the proof of Theorem 1.2 in Section 5 of [39] verifies the asserted estimate on 



2.4 Spectral-Gap Transference under Non-Negative Curvature 

First, we state transference principles which were obtained by the second named author. 
In their most general form they involve total variation estimates. Recall that the total- 
variation distance dxv between two absolutely continuous probability measures p\,p2 on 
(M, g) is defined as: 



For notational simplicity, we formulate the results of [38] in the Euclidean setting and refer 
to that paper for the general case. Transference results with respect to the 1-Wasserstein 
and relative-entropy distances are proposed in [40]. 

Theorem 2.5 ([38]). Let p\ and P2 denote two log-concave probability measures on Eu- 
clidean space (W 1 , 1 • |), meaning that pi = exp(— Vi(x)) dx with Vi : M n — > IRUj+oo} convex. 
Assume that either of the following conditions is satisfied for some L E (1, +00): 

1. P2 <C pi and \\dp2/dpi\\ Loo < L; or, 

2. pi <S P2 and \\dp\/ dp2\\ Loo < L; or, 

3. d TV (pi,p 2 ) < 1 - — . 

Then the i-th assumption (i = 1,2,3) implies: 




C(p, k, L,p), concluding the proof. 



□ 





P2) > Ci(L) 2 ps G (^ n , 



where, for some universal constant c > 0: 



Ci(L) 



c 



C 2 (L) = -1 , C 3 (L) 



c 



(2.11) 



1 + log(L) 



L2(l + log(L)) 
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In order to compare the above assumptions it is worthwhile to record: 



Lemma 2.6. Let \ii <C /Ui <C voIm be Borel probability measures on a Riemannian manifold 
(M,g). If^S^>D)<5, then d TV {^^) < 1 



d[i\ J ' max(l,D) 



Proof. 



1 - drvfalto) = J min {J^-, dvol M = | 



mm 




max(l, D) 

□ 



Consequently the hypothesis of Case 1 (or 2) implies the one of Case 3. However, 
applying Case 3 of Theorem 2.5 gives the poorest dependence in L as Cs(L) is much smaller 
than Ci(L) and C^-L). On the other hand, the total variation distance is robust and easy 
to control. For instance if we assume that \ii <C /ii are log-concave on W 1 and verify 
J (dfi2/ 'd^i) p d^i < L p , then by the Markov-Chebyshev's inequality, for any D > 0: 

Consequently, we have by Lemma 2.6: 

/ 1/ £ p \ \ P - 1 

drv(vi,V2) < mf 1 - — 1 



D>1 \ D \ Dp- 1 J J ( p L)p/(p- 1 ) 

Therefore applying Case 3 of Theorem 2.5 yields that the spectral-gap of is at least that 
of \i\ divided by a power of L. This polynomial dependence would not be good enough 
for the applications we have in mind. The goal of the next theorem is to provide a new 
quantitative transference principle for the spectral- gap, which generalizes Case 1 of Theorem 
2.5 (corresponding to the case p = oo below): 

Theorem 2.7 (Spectral-Gap Transference under Convexity Assumptions). Let \ii <C fii 

denote two Borel probability measures on a common Riemannian manifold (M,g), and 
assume that (M,g,/i2) satisfies our convexity assumptions. Assume that for some p > 1: 

^] P d^<LP. (2.12) 

dm J 

Then: 

p S G(M,g,fi 2 ) > C(L,p) 2 p5G(M, 5 , / ui) , 

where: 

C(L,p) := c V — / , (2.13) 
p 1 + log(L) 

and c > is a universal constant. 
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Proof of Theorem 2.7. Set p = pga(M, g, K.\ = K.{M,g,p\) and /C2 = IC(M, g, ^2)- By 
a result of M. Gromov and V. Milman [17] (see also [38, Corollary 2.7]), a spectral-gap 
inequality always implies the following exponential concentration: 

JCi(r) <exp(-cy/pr) Vr > , (2.14) 

where c > is a universal constant. Using Proposition 2.2, we deduce from (2.12) that: 

JC 2 {r) < 2L/Ci(r/2) 1 " 1 / p < 2Lexp (" c ^^V^ r ) Vr > . 

Using our convexity assumptions on (M,g,p2), Theorem 2.3 implies that the following 
isoperimetric inequality is satisfied: 

n-(A/r \( \ ^ n P~ l ■ ( logl/w 

p V l°g l/ w + l°g( 2 ^) 

> dy/p *' 1 1 i; V«€ [0,1/2] 
p 1 + log(L) 

(note again that L > 1 by Jensen's inequality). This means that [M,g,p 2 ) satisfies a linear 
(or Cheeger-type) isoperimetric inequality. As described in [40] in greater detail, it is known 
by results of Maz'ya [34, 35] and independently Cheeger [12] that this implies a spectral-gap 
inequality: 

, — 1 l(M,g,p2)(v) . dp -I 1 

2 «e(o,i/2] u 2 p 1 + log(L) 

as asserted. □ 

We can actually prove a result stronger than Theorem 2.7, using stronger results from 
[38, 39, 41]. However, in practice a convenient way to check the hypothesis of the next 
theorem would be to verify the condition of the previous one. 

Theorem 2.8. Let p>2 pi denote two Borel probability measures on a common Rieman- 
nian manifold (M,g), and assume that (M,g,p2) satisfies our convexity assumptions. Let 
L > 0. There is a universal constant c > such that 

^{^ >L }-1 PSG{M,g,p2)>^ f:f PsG{M,g,p 1 ). 

Proof. Let us adopt the notations of the previous proof. First note that our hypothesis 
ensures L > 7/8. Next we apply (2.4) for e = | and M(e) = L, to get for any r > 0: 
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Applying this to r = log(8L) / {c^J~p) and using exponential concentration for \i\ in the form 
of (2.14), we obtain: 

< \ , n := -L,„ g ( 8i ) +Cr ' (A) < A.,„ g(8 L) . 

At this point, we use a much stronger result than Theorem 2.3 for transferring concentration 
to linear isoperimetry when the convexity assumptions are satisfied, namely (see [38, 39, 41]): 

. j{M,g,» 2 )(v) 1 - 2/C 2 (r) 

mi > sup ■ 



u6(0,l/2] V r> Q r 

Applying this inequality for r 2 , and arguing as in (2.15), we obtain: 

, — 1 . i(M,g,fi 2 )(v) 1 c^p 

2 i)6(o,i/2] v 4r 2 81og(8L) 



□ 



3 The Conservative Spin Model 

Let [x = exp(—V(x))dx denote a probability measure on M, and let Xq be a random variable 
on E with law [i. Let {Xi} i=1 n be independent copies of Xq, so that X = (Xi, . . . ,X n ) 
is a random-vector in M. n distributed according to: 

Hn := LL® n = exp(-H(x)) dx , 

where H{x) denotes the non-interacting Hamiltonian: 

n 

H{x) = Y J V{*i) ■ 
i=i 

The measure \i n is the Gibbs measure corresponding to the grand canonical ensemble of 
non- interacting spins. We obtain from it the measure lie corresponding to the canonical 
ensemble, by conditioning the mean-spin S = ^ -^i ^° be 0, namely: 

He = TT- exp(-H(x)) dvol E (x) , 

where we equip M. n with its standard Euclidean structure | • |, denote by vol^ the induced 
Lebesgue measure on the hyperplane E = Eq given by YH=\ x i = 0) an d denote by Ze > 
a normalization term. Note that Ze is the density at of pP ', the push- forward of p n by 
the orthogonal projection on the diagonal line D spanned by the vector (1/y/n, . . . , 1/y/n). 
We denote by np the orthogonal projection onto the subspace F. 
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Note that //e is not absolutely continuous with respect to fj, n , so we define fJ,E,w by 
"thickening" it uniformly in the diagonal direction by a width of w > from each side: 

d^E,w{x) = exp (-H(tt e (x))) l^ D ( x \\< w dx , Z E , W = 2wZ E . 

Recall from the Introduction that we denote M p = M p (fj,) := M\Xq\ p and set Di^ x = 
:= II^'(^o)||l w j where ||Y|| L ^ denotes the norm of a random-variable Y. 
Given 5 > 0, we set Y := sup^ [Xo _ StXo+s] \V"(£)\, and denote D 2> <$ 1 = = D^O-O : = 

||Y || Lwi and D 2 = D s 2 = D 6 2 {y) := E(Y ). 

A central tool we will use in this section is the following large-deviation bound of 
Bernstein-type (e.g. [31]). 

Theorem 3.1 (Bernstein). Let Y±, . . . ,Y n denote a sequence of independent random-variables 
on R, with E(Yj) = and \\ Yi \\ L < for each i = 1, . . . , n. Then there exists a universal 
numeric constant c > 0, so that for any vector a = (oi, . . . , a n ) and t > 0: 



ajYi > t) < exp — cmin 



t 2 



3.1 Concentration Transference from fi n to /je.w - One-Sided Approach 

In this subsection, we present a first way to transfer concentration estimates from (i n to 
He,w It only requires controlling the second derivative of V from one side. To this end, we 
need the following local version of the Berry-Esseen Theorem (see [48]), and in fact, only 
at the center point x = 0: 

Theorem 3.2. Let X\, . . . ,X n denote i.i.d. copies of the random variable Xq, having 
absolutely-continuous law fi. Assume that M(Xq) = 0, that M 3 = M^fj,) < 00 and that 
A := \\dfi / dx\\ Loo < 00. Then for any n > 1, the density p n (x) of Y^=\ %i satisfies: 



sup 



/ X 2 



p n (x) exp 



C ( M 3 w o\ 1 

< -7= max -^,M 3 A 3 1 



n 



jWM 2 V 2M 2 
for some universal constant C > 0. 
Proposition 3.3. Let p£R, and assume that: 

• V € C 2 (R) and sign(p)V" > —k, for some k > 0. 
. := \\V'(X )\\ L9i <oo. 

Then for any w > and integer n greater than r/ = r/l p l w£,1 >*i , the following estimate holds: 

/ exp(p(H(x) - H{-k e {x))) l\n D (x)\<w dfJ*n(x) <Cexp{Cw 2 {\p\n + p 2 Dl^ 1 )) , 
for some universal constant C > 1. 
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Proof. Assume that p > 0, it will be evident from the proof that the case that p < is 
treated identically, after exchanging H{X) and H(tte(x)). We write: 



exp(p(H(x) - H{i: E {x))))l\^ x) \< w dix n (x) 

/oo 
pex.p(pu)fi n {x G M. n ; \7rrj(x)\ < w , H(x) — H(tte(x)) > u} du 
-oo 

POO 

< e pu ° + / pe pu fx n {x G R n ; |vr D (x)| < w , F(x) - H(tt e (x)) > u} du , (3.1) 
J uo 

for some uo to be determined. We proceed by roughly evaluating the integrand as follows: 

Hn {ieK"; \ttd(x)\ < w , H(x) - H(ir E (x)) > u] 

( n n > 

= fi n \ x G R n ; \ir D (x)\ < w , ^ V( Xi ) - V(ir E (x)i) > u \ 



i=l 
n 



i=l 
ii 



< fi n I x G R n ; 3t G [-w, w] , ^ Vfa) - Vfa + t/y/n) > 



i=l 



i=l 



Applying Taylor's theorem and using in addition that V" > — /c, we deduce that: 
/i n {x G R n ; |vtd(x)| < w , fl"(x) - H(tt e (x)) > u} 



< /i n < x G R n ; 3t £ [-w, w] , --^Y j V'{x i ) + t - 

n 



K> U 



W 



i=i 



w 

> U K 

2 



(3.2) 



It is easy to verify that EV'(X ) = V'(x) exp(-V(x)) dx = - J™ x dexp(-V(x)) = 0. 
Consequently, whenever u > uq and n tends to infinity, the expression in (3.2) will be 
governed by the Central-Limit Theorem, in accordance with the heuristic argument from 
the Introduction. To obtain exponentially decaying estimates on (3.2) as u — > oo for each 
individual n, we apply Theorem 3.1 , which together with (3.1) yields: 

/ exp (p(H(x) - H(tt e {x))) \ D (x)\< w dn n (x) 

, s , v f°° ( ( Vnu 

< exp(pno) + 2pexp(puQ) / exp pu — cmin — — 

Jo \ ywDi^ 1 

When n is large enough, the above estimate is clearly bounded by: 

,2 \ 



w 2 D 2 



du . 



< 2exp(puo) ^1 + p J exp i^pu — c 



in 2 D 2 
2„..2 t~i2 



du 



< 2exp(pu ) ( 1 +py / 7r/cwD 1 ^ 1 exp(p w D 1 ^ 1 /(4c))J . 



22 



Recalling the definition of uq and using that l+exp(x 2 )x < C exp(Cx 2 ) for an appropriately 
chosen C > 1, we obtain that the latter quantity does not exceed: 

as asserted. □ 

Corollary 3.4. In addition to the assumptions of Proposition 3.3 (for positive p), assume 
that the barycenter of fi is at the origin, that M3 = M^fj,) < 00, and that A := \\dfj,/ dx\\ Laa < 
00. Then setting: 



w := A min M 2j — — , (3.3) 



the following estimate holds: 

1/4 



n \ a/x n J J V 



/or any integer n greater than rj M ' A / M ^ ' M 3^ 3 j where C > is a universal constant. 
Proof. Recall that: 

d/J,E,w(x) = -^—exp(-H(ir E (x))) h nD ( x )\< w dx . 

^E,w 



Consequently, applying Proposition 3.3 with p = 4, we know that for any integer n greater 
than 77 wDl >*i : 

Recall that Ze,w = 2wZe and that Ze is the density at of ~^Y^i=i Xi- It follows by 
Theorem 3.2 that Ze > n whenever: 

for an appropriately chosen universal constant C > 0. Plugging this estimate into (3.4), 
we obtain: ^ 

(f {^XdA <C"^eMC W \K + Dl^)), 

yj R n \ dUn J J W 

and the asserted estimate follows when setting w = Wq. □ 
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3.2 Concentration Transference from [i n to \i E ^ w - Two-Sided Approach 

In this subsection, we present a variant of the procedure carried out above. It only requires 
the following version of the classical Berry-Esseen Theorem (e.g. [48]): 

Theorem 3.5 (Berry-Esseen). Let X\, . . . ,X n denote a sequence of independent random- 
variables on R, with E(Xj) = and E(\Xi\ 3 )/E(Xf) 3 / 2 < M for each i = 1, . . . ,n. Let Z 
denote a standard Gaussian random variable on R and denote: 

>->n — 



/Er=i E ( x f 

Then: 

VieR |P(5„ < t) -¥(Z < t)\ < C 



jn 

for some universal constant C > 0. 

Proposition 3.6. Let fj, = exp(—V(x))dx denote a probability measure on R, and assume 
that: 

• V G C 2 (R) and lim^-too V(x) = +oo. 

• Di^j = D\^ 1 (fi) < oo, and there exists 5 > so that D 2 ^ 1 = D^y AfJ.) < oo. 

Then for any p G R, w > 0, and integer n greater than n = r j™ Dl > st i'' w D2 ^i' w '\ p \ ^ ^ e 
following estimate holds: 

/ exp (p(H(x) - H(tt e (x)))) l\ %D i x )\< w dfi n {x) < Cexp (Cw 2 (\p\D 2 +p 2 Dl^ i )) , 

for some universal constant C > 1. 

Proof. Let us assume that p > 0, it will be evident from the proof that the case that p < is 
treated identically, after exchanging H(x) and H(ir E (x)). Repeating verbatim the relevant 
parts of the proof of Proposition 3.3, we verify that: 



exp(p(H(x) - H(ir E (x))))l\n D ( x )\< w dfj, n (x) 

/•oo 

< e puo + / pe pu fi n {x £ R n ; \ir D (x)\ < w , H(x) - H(ir E (x)) > u} du , (3.5) 

J u 

for some no £ R to be determined, and that: 

fi n {x £ R n ; \n D (x)\ < w , H(x) - H(tt e (x)) > u} 



, \ w 

< fi n < X e 

in 



i=l 



2 n 

+ su p _\v"m\>u 

i=l £,i£[xi-w/^n.,Xi+w/^n] j 
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When n is greater than (w/5) 2 , we evaluate this by: 



w 



t=l 



+ 



W 

2n 



sup 



< 



w 



i=l 



I \£i€[xi—5,Xi+S\ 
2 



Do\>u 



D 2 u? 



i=l 



U — Uq 



(3.6) 



where Y\,...,Y n denote independent copies of Yq and uq := D 2 w 2 /2. 

Observe that by definition E(Yo — D2) = 0, and that EV'(-Xo) = as before. Conse- 
quently, whenever u > uq and n tends to infinity, the first and second term in (3.6) will 
be governed by the Central-Limit Theorem and Law of Large Numbers, respectively, in 
accordance with the heuristic argument from the Introduction. To obtain exponentially 
decaying estimates as u — > 00 for each individual n, we apply Theorem 3.1 to each of the 
terms, which together with (3.5) yields: 



exp(p(H(x) - H(tt e (x))) l\T, D {x)\< w dn n {x) < exp(pu ) 

,2 \ \ 



, , f°° ( , ( y/nu 

+ 2pexp(pu ) / exp pu - c min - — , 

Jo \ \2wD lj9l 

+ pexp(puo) J exp \vu — c'n min 



u 



du 



u 



w 2 (D 2 ^ 1 + D 2 ) ' ^(A^ + D 2 ) 2 



du , 



where we have used that ||Yo ~~ Azllx^ < II^oIIl^ + 11-^2 IIl = ^2,*i + D 2 . We see that 
when n is large enough, the above estimate is clearly bounded by: 



< 2exp(pit ) 1 +pexp(pu ) / exp pu - c ™ — 
V Jo V wD Ui. 



du 



< 2exp(pu ) \l+pexp(p w D l ^, i / {Ac))wD 1 ^ 1 y' n / c 

Recalling that uq = D 2 w 2 /2 and using that l+exp(x 2 )x < C exp(Cx 2 ) for an appropriately 
chosen C > 1, we obtain that the latter quantity is at most: 



C exp{pD 2 w 2 /2 + CVVDf Tl ) , 



and the assertion follows. 



□ 



Corollary 3.7. In addition to the assumptions of Proposition 3.6, assume that the barycen- 
ter of \i is at the origin and that M3 = M%{u) < 00. Then setting: 



w :-- 



mm 



M 2 , 



1 



D 2 + 



(3.7) 
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the following estimate holds: 

4 \ V4 



If f^^\ d \ < Cmax (l,M 2 (i} 2 + ^ fl )) , 

yJlR" \ dfl n J J 

for any integer n greater than r\ = T j^ 2 ' Ms M ^ A,*!,^,*! ^ where C > is a universal 
constant. 

Proof. Recall that: 

d/J, E ,w( x ) = if— exp(-H(n E (x))) h WD r x M< w dx . 
Consequently, applying Proposition 3.6 with p = 4, we know that for any integer n greater 

D 1V D 2l i5r-,,w,4 

than T]g ■ 

[L (^f ) 4 ^) ' - ^MC'w\D 2 + Dl^)) . (3.8) 

Since Ze, w = 2wZe and Ze is the density at of Yli=i it is clear by the Central- 
Limit principle that when n is large enough, Ze should approximate the density at zero of a 
Gaussian random variable with zero mean and variance a 2 = - Y^t=i i- e - l/(v / 2vrcr)- 

It is possible to make this rigorous by invoking a local Central-Limit Theorem, as in the 
previous subsection; however, anticipating future situations where the Central-Limit The- 
orem is not available due to dependencies, we proceed as follows, even though this incurs 
a quadratic penalty in our final estimate. Applying the Cauchy-Schwartz inequality and 
using Proposition 3.6 again with p = —1, we obtain: 

Ze,w = / exp(H(x) - H(TT E (x))l^ D ^\< w dfj, n (x) 



> 



J Rn exp (-(H(x) - H(ir E (x))) l\n D (x)\<w dfJ, n (x) 



> {f, D {[-w,w]}Y 



C"eMC"w 2 (D 2 + Dl^)) ' 
We can now estimate [i D {[— w, w]} by invoking the Berry-Esseen Theorem 3.5, which yields: 

\/w > \(jl d {[-w,w]} -P(Z G [-w/a,w/a])\ < 



M 2 3/2 V" 



for some universal constant C > 0, where Z is a standard Gaussian random-variable on 
Consequently, when n is large enough, we obtain: 

Ze,w > cmm(l,w/a) 2 exp(-C"w 2 (D 2 + Df ^J) . 
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Now plugging this back into (3.8) and using w = wq, we obtain: 

(^Sf ) 4 ^ n ) ' - c(a/wo)2 ' 

as asserted. □ 
3.3 Log-Sobolev Transference from /x to /ig 

We now translate the above transference results from the concentration to the log-Sobolev 
level. 

Theorem 3.8. Let p = exp(— V(x))dx denote a probability measure on R with barycenter 
at so that D\^ 1 = D\^ 1 (p) < oo and D 2 ^ 1 = (fi) < oo for some 5 > 0. Assume in 
addition that (R, | ■ satisfies LSI(p), and that: 

- K := inf V"(x) > ~ . (3.9) 

Then for any integer n greater than n = T)*fg ^ 2 '■ Dl >*i'- D2 «*i ^ ^ e conservative zero-mean 
spin system (E, \ • \,(J,e) satisfies LSI with constant: 

P 

PLS(E, | ■ \,fiE) > Cqc ' 

where c, C > are universal constants and Q is the following scale-invariant quantity: 

Q := max(l, M 2 (D 2 + D^)) . 

Proof. First, note that the Herbst argument (see Appendix) implies the sub-Gaussian decay 
of Lipschitz functions on a space satisfying LSI, which in particular ensures the existence 
of all finite moments of Xq, and implies M 2 < C/p, for some universal constant C > 0. 

It is well-known (see Appendix) that the LS inequality tensorizes with respect to the 
Euclidean norm, and so (R n , | • \,p n ) satisfies a LS inequality with the same constant p. 
Since dp n (x) = exp(—H(x))dx with H(x) = J27=i V{ x i)i ^ follows that Hessif > —Kid as 
tensors in R n . The same bound holds for the restriction of these tensors onto any linear 
subspace, and so it follows that (E,\ ■ \,pe) satisfies our K-semi-convexity assumptions. 
Moreover, the uniform thickening of pe hi the direction D orthogonal to E only adds as 
an eigenvalue to the Hessian matrix in that direction, and hence (R n , | • \ , pe,w) also satisfies 
our K-semi-convexity assumptions for any w > 0. 

We now transfer the log-Sobolev inequality on (R n , | • \,p n ) onto (R n , | • \,Pe,w ) by 
applying Theorem 2.4 with p = 4. Note that (3.9) implies that p > (16/3)/c, which is 
required for applying Theorem 2.4, and consequently the parameter 6 in that theorem 

satisfies 6 > 1/3. Estimating f Rn ( f ' w ° J dp n using Corollary 3.7, it follows that for any 



, Af 2 ,M 3 /M 2 3/2 ,D 1 . lfl ,D 2 , s , 1 
integer n larger than rj s : 



PLsQ& n ,\ ■ \,f*E,wo) >C^ 
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for some universal constants c, C > 0. By the tensorization property of the LS inequality: 

PlsQR™, | • \,He,w ) = min(pLs(E, | • \,p E ),PLs(^, I ■ |, ^[-wq^q])) ' 
where ^[- W0:W0 ] denotes the uniform measure on [—wq,wq]. It follows that: 



Pls(E, | • \,He) > PLsQ& n , I • \ ,PE,w ) > c- c 



P_ 

Q c 



as asserted. □ 

Repeating the above argument and replacing Corollary 3.7 by Corollary 3.4, we obtain: 

Theorem 3.9. Let p = exp(— V(x))dx denote a probability measure on M. with barycenter 
at so that D\ ^ 1 = Di^ 1 (p) < oo and A := ||d/x/dx|| ioo < oo. Assume in addition that 
(R, | • \ ,p) satisfies LSI(p), and that: 

— K := inf V"(x) > —— . (3.10) 

Then for any integer n greater than n = rj M3 ^ M ^ ' MaA3 , the conservative zero-mean spin 
system (E, \ ■ \ ,pe) satisfies LSI with constant: 



Pls(E,\ -\,Pe)> c-^ 



where c,C>0 are universal constants and Q is the following scale-invariant quantity: 

Q := max(l,M 2 (K + J D^ 1 )) . 

Remark 3.10. We did not attempt to optimize over numeric constants above. In particular, 
the constant 8 in the conditions (3.9) and (3.10) may easily be improved down to 4 + e by 
using a p greater than 4 in Theorem 2.4 and Corollaries 3.7 and 3.4. Moreover, it should 
be possible to improve it all the way down to 1 + e (and similarly, the constant 4 in (2.8) 
should be pushed down to 1) by carefully revisiting Lemma 2.1, and replacing /Ci(r/2) in 
(2.2) by /Ci((l — £)r) for some arbitrarily small £ > 0. We refrain here from pushing these 
numeric constants to their limit, since this seems irrelevant for applications. 



4 Weakly Interacting Conservative Model 

In this section, we modify our non-interacting Hamiltonian H (x) = Y^t=x D Y adding 

some weak interaction term — Ia{x) corresponding to a weighted n by n symmetric matrix 
A = {aij} with zero diagonal (ay = a^j and m t i = 0): 

n 

H A {x) = H(x) - I A (x) , Ia(x) := a^XiXj . 

i,3=l 
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Define the corresponding Gibbs probability measure: 



fj,A ■= — exp(-HA(x))dx , 

Z A 

where Za > is a normalization term. To make sure that the Gibbs measure is indeed well 
defined (at least for weak enough interactions, see below), we will require throughout this 
section the following: 

Assumptions for Interaction: 

• p has barycenter at the origin: J xdp(x) = 0. 

• jj, has sub-Gaussian tail decay: 



Bp > VA e M / ex.p(Xx)dp(x) < exp(— ) . (4.1) 



Let X = (X\, . . . ,X n ) denote a random- vector in R n distributed according to pa, set 
S = Ya=i Xi, and let pa,e denote the law of X conditioned on S = 0, i.e.: 

VA,E = -77— exp (-Ha(x)) dvol E {x) . 

^A,E 

Again, fJ,A,E is not absolutely continuous with respect to fiA, so we define pa,e,w by "thick- 
ening" it uniformly in the diagonal direction by a width of w > from each side: 

dHA,E, w {x) = ey^{-H A {^E{x)))lw D ( x )\< w dx . 

2wZ a ,e 

To see that the Gibbs measure is indeed well defined for weak interactions, and to 
establish all of our estimates in this section, we require the following theorem, communicated 
to us by Rafal Latala, to whom we are indebted; its proof is deferred to the Appendix. 
Recall that we denote the operator and Hilbert-Schmidt norms of A by and 
respectively. 

Theorem 4.1 (Latala). Let X±, . . . , X n denote a sequence of independent random-variables 
on R, so that for each i = 1, . . . , n, the law pi of Xi satisfies the Assumptions for Interaction. 
Then there exist universal constants C2,C2 > so that for any integer n > 1 and n by n 
symmetric matrix A = {ofj} with zero diagonal: 



I < C 2 exp f -c 2 min f 9 % , j-^— J J Vi > 

v wwis \\ A \\o P 
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4.1 Concentration Transference from \i n to fj, E ,w 

The main new calculation in this section is given in the following: 
Proposition 4.2. Let p£R, and assume that: 

• The Assumptions for Interaction are satisfied. 

• For some appropriately chosen universal constants C^,C4 > 0: 

C 3 P 



OP ~ 1 + ClyfpW \P\ 



(4.2) 



Then: 



/ exp(j>I A (n E (x)))lw D f x )\< w dfj, n (x) < C 5 exp(|p| \\A\\ w 2 +C§p 2 {l+C Ay fpw) 2 \\A\\ 2 HS /p 2 ) . 

Proof. We may assume that p > 0, since otherwise we may replace A by —A. Furthermore, 
all assumptions and both sides of the desired inequality are invariant under the transfor- 
mations Xq h-> Xo/y/p, A/p I—)- A, Wy/p i—)- w and p i-> 1, and so we may assume that p = 1; 
nevertheless, we proceed in full generality. We evaluate: 



/ 
f 

j — i 



exp(pI A (tt e (x) ) ) 1 1 7TD ( x) | < w dp n (x) 

pexp(pu)p n {x E M. n ; |7Td(:c)| < w , Ia{^e(x)) > u} du 

o 

/•oo 

< e puo + / pe pu p n {x £ 1" ; |vtd(x)| < w , I A {^e{x)) > u} du , (4.3) 

J no 

for some u$ to be determined. We proceed by roughly evaluating the integrand as follows: 
p n {x G M n ; |vr D (2;)| < w , I a (tt e (x)) > u} 

n 

£R n ; \ir D (x)\ <w , ai,j{TT E (x))i{i: E (x))j > 



Pn < 



< Pn < 



el"; 3t G [-w,u>] ^ Oijfci + t/y/n)(xj + t/\/n) > 



< /i„, j: e 



tin . \ " 2w 

tj=i 



n 


w 2 


n 






+ 




> 









Since: 



hj =1 



< \\A\ 



up 
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setting uq = w 2 , we obtain: 



n 




E a i,j X i 


> U — Uq^ 


*J=1 





< P ( aijXiXj > a(u - u ) 1 + P ( ^= 



E a i,j X i 



> (l-a)(u-u ) (4.4) 



for some a £ (0, 1) to be determined. The first term above is immediately estimated using 
Theorem 4.1. Bounding the second term is elementary, since using (4.1) and independence: 

'a 2 A , .A / P t 2 



Vt>0 P Y^aiX, > t < inf exp — V a 2 - Xt) = exp 



Since: 



we obtain: 



n / n \ 

E ( 7^ E a «j = WMi, ■ • ■ > Vv^)i 2 < P| 



Vt > 



2w 

/re 



E(E°v) Xi 

i=i i=i 



> t\ < 2 exp 



Combining the estimates on both terms in (4.4), we conclude that: 

Vu > n , Va G (0, 1), p, n {x £ M. n ; |vr D (x)| < io , I a {^e{x)) > u] 

( . f a 2 p 2 (u-u ) 2 ap(s-s ) \\ ( p{l - a) 2 {u - u f 

< C 2 exp -c 2 mm — — 5 , — n — n + 2 exp - 



|A| 



HS 



\A\ 



op 



8w 2 \\A\ 



op 



Plugging this into (4.3), we obtain: 



/ exp(pI A (ir E (x)))l l7TD ( x)l < w dp, n (x) < exp(pu 
+ C2pexp(puo) J exp ^pu — c 2 min ^ 



a 2 p 2 u 2 apu 

iiTP ' nr 



/ (l-a) 2 /m 2 \ , 
+ 2pexp(j>u ) / exp pu — — -tj— du . 

\ 8w 2 A 



op 



Setting: 



a 



1 + T V / 8c 2 pw- l 
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we obtain: 



/•oo 

< exp(pn ) + (C 2 + 2)pexp(pu ) / exp 

Jo 



( \ 

c 2 p u z 

pu - 



du 



+ C 2 pexp(pu ) / exp \ pu — -r- 

7o ^ (i 



Since ||^4|| op < ||^4||#s> we see that as soon as (4.2) is satisfied with say C3 = C2/2 and 
C4 = \/8c2) we obtain: 

< e pu ° I 1 + (C 2 + 2)p /°° exp (pu - - — ) du + C 2 p f°° e~ pu du) 

V Jo \ {l + C 4 ^pwf\\A\\ 2 HS ) Jo ) 

< ^ fl + g2 + (C 2 + 2)pexpP (1 + yf ^U\\ HS ^C^ PW )\ 

Noting that l + xexp(x 2 ) < Cexp(Cx 2 ) for some appropriately chosen constant C > 1, and 
recalling that uo = ||A|| op ui 2 , the assertion follows. □ 

Proposition 4.3. Let the Assumptions for Interaction and the assumptions specified in 
Proposition 3.6 be satisfied. Assume further that for an appropriate universal constant 
c > 0: 

WMop < C P ■ ( 4 -5) 

Then setting: 



w 



min Mo 



• A 111111 I -L F J. J . o I * 

The following estimate holds: 

1/4 



, 77 • , , m 2 ,m 3 /m| /2 ,d 1 ,* 1 ,d 2i * 1 

for all integers n greater than n = T]? 



Proof. Given w > 0, we write: 

dfiA,E,w 



djl n 



dpn - 
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where: 



f(x) = exp(H(x) - H(ir E (x))) , g{x) = exp(I A (ir E (x))) , h(x) = l\-n D {x)\< w ■ 

Since we do not care about numerical constants (even inside exponents), we proceed by 
applying the Cauchy-Schwartz inequality several times, and obtain: 

dfJ-n < ( ( fh dp T \ ( f g 8 h dp r ^) ( f -^-h dp,^) ( f h dfi r 



V d t l n / \J J \J J \J 19 

< (Jfhdfin^ (Jt^hdy^j (Jf^hd^ (Jg- 2 hd^ (Jhd^\ (.4.6) 

The integrals above involving / and g were estimated in Propositions 3.6 and 4.2, re- 
spectively. And since f h d/j, n = pP {[—w,w]}, the Berry-Esseen Theorem 3.5 implies as in 
the proof of Corollary 3.7 that: 

> \n D {[-w,w}} - P(Z € [-w/a,w/a])\ < 



for some universal constant C > 0, where Z is a standard Gaussian random- variable on 
and a 2 = ^ Yl7=i ^•(Xf)- Consequently, when n is large enough, we obtain: 



J h dfi n > cmin(l, w/a) . 



Plugging all of these estimates into (4.6), we obtain for all integers n also larger than 
r]$ 2,4,1 from Proposition 3.6, and all interaction matrices A satisfying (4.2), that: 

4 \V4 

dn- I < f7maYh rr/nA 2 Pvn ( C i m 2 ( Bn -4- D? ... \ -4- m 2 II 41, 

I Op ; 



/ (^f^) 4(i/in ) " ^^(l^/^exp^^^ + ^J + ^PI 



exp^C(l + C 4 ^) 2 ^^j . 

Setting w = wq and noting that a 2 = Mi < C / p thanks to the sub-Gaussian decay as- 
sumption (4.1), it follows that ^/pwq < ^fpa < \[C. The smallness assumption (4.2) then 
translates into the assumption (4.5), and the assertion consequently readily follows. □ 

4.2 log-Sobolev Transference from fi to fi E 

We can now obtain: 

Theorem 4.4. Let p = exp(—V(x))dx denote a probability measure on K with barycenter at 
the origin, so that D\^ x = Di j vp 1 (/i) < oo and D2^ x = (p) < oo for some 5 > 0. Let 

A denote an n by n symmetric matrix with zero diagonal. Assume that (R, | ■ \,p) satisfies 
LSL(p), and that: 
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-k:=w£V"(x)>-%. (4.7) 

WMop < CP , (4.8) 
where c > is an appropriate universal constant. 

Then for any integer n greater than rj = n s 2 1 ' 1 , the weakly-interacting con- 
servative zero-mean spin system (E, | • | , pa,e) satisfies LSI with constant: 



Pls(E, | • \,Ha,e) > C-prp , 



P_ 

where c, C > are universal constants and Q is the following scale-invariant quantity: 
Q := max (l, M 2 (D 2 + D^J) exp(||A||| s / p 2 ) . 

Proof. The proof is almost identical to that of Theorem 3.8. 

Since p satisfies LSI(p), it follows by the Herbst argument (see Appendix) that the sub- 
Gaussian tail decay assumption (4.1) is satisfied (with the same constant p). In particular, it 
follows that M 2 < C/p, for some universal constant C > 0. It is well-known (see Appendix) 
that LSI tensorizes with respect to the Euclidean (£2) norm, and so (R n , | • |, p n ) also satisfies 
LSI(p). Since Hess-ff > —nld and Hessi^ = A < ||^4|| as tensors in M n , it follows that 
Hess-ff^ = Hess-ff — Hess/A > —kaM, where ka = k + \\A\\ . The same bound holds for 
the restriction of these tensors onto any linear subspace, and so it follows that (E, \ ■ \, pa,e) 
satisfies our K^-semi-convexity assumptions. Moreover, the uniform thickening of pa,e in 
the direction D orthogonal to E only adds as an eigenvalue to the Hessian matrix in that 
direction, and hence (M n ,| • \,Pa,e,w) also satisfies our KA-semi-convexity assumptions for 
any w > 0. Note that by ensuring that the constant c in (4.8) is not greater than 1/24, it 
follows from (4.8) and (4.7) that k a < § • 

We now transfer the log-Sobolev inequality on (W l , \ ■ |,/i n ) onto (W 1 , \ ■ \,pa,e,w ) by 
applying Theorem 2.4 with p = 4. To this end, we invoke Proposition 4.3 for estimating 

J]R n ( ^d/f W ° ) d[i n . Note that p > 6k a > (16/3) ka, which is required for applying The- 
orem 2.4, and consequently the parameter 6 in that Theorem satisfies 9 > 1/9. Also note 
that all of the assumptions for applying Proposition 4.3 are indeed satisfied, including the 
sub-Gaussian tail decay assumption (4.1) and the smallness condition (4.5) (by appropri- 

J dp n using Proposition 4.3, it follows 
that for any integer n larger than „^ 2 > M3 / M 2 .- D i,* 1 . £) 2,* 1 _ 



PLs(^ n ,\ ■ \ ,PA,E,w ) > C- c 



where c, C > are universal constants. By the tensorization property of the log-Sobolev 
inequality: 



p LS {R n ,\ ■ \,PA,E,w ) = min(pLs(E, I • \,PA,E),PLS(MA ■ I, V[-w ,w ])) 
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where v\-. WoV10 ] denotes the uniform measure on [— wo,wo]. It follows that: 

Pls(E, I • \,Ha,e) > PL.s(^ n , I • \,Pa,e,w ) > c-^ , 
as asserted. □ 

5 Uniform Bounds for Arbitrary Mean-Spins 

5.1 Conservative Spin Model 

We have thus far only treated canonical ensembles obtained from conditioning the grand 
canonical ensemble fi n on having the mean-spin S = ^^2Xi fixed at 0, the assumed barycen- 
ter of [i. Consider now the canonical ensemble \xe s obtained from conditioning the grand 
canonical ensemble fj, n on having a mean-spin S = s, namely: 

d^E s = — exp(-H(x)) dvol Es (x) , 

Z E S 

where E s denotes the affine hyperplane ^ Y17=i x i = s an d Ze b > is a normalization term. 
Naturally, we will only consider values of s which lie in isupp(/i), the interior of the support 
of [l. 

To handle arbitrary mean-spin values, we use the well-known Cramer trick, which is the 
key ingredient in the Cramer Theorem on Large Deviations, a central tool in the approaches 
of [19, 37, 36]. Denote by /i Aa the probability measure on K. obtained from [x by multiplying 
its density by exp(ax) and renormalizing: 

An 1 ^1-^ / \ 7 

Note that in our setting fi Aa is indeed well defined, since [i is always assumed to satisfy 
LSI, and hence (by the Herbst argument) it must have sub-Gaussian tail-decay, and so 
Z a = J exp(ax)dfi(x) < oo for all a G M. The key observation is that the densities of \x n 
and ([i Aa ) n coincide up to a constant multiple on each hyperplane E s , and consequently 
PE a = {p Aa )E s for all a,s G K. 

Let X Aa be a random variable distributed according to fi Aa . Given a 6 1, we denote 
s(a) := E,(X Aa ) (note that this expectation always exists in our setting). The function ]R 9 
a I—)- s(a) G isupp(//) is well known to be increasing and onto, and we denote its inverse by 
a(s). Denoting by T s the function T s (x) := x—s translating by s to the left, we denote [i a := 
(T s ( a >)*(/i Aa ), the translation of fi Aa having barycenter at 0. We denote by X a the random- 
variable with law fj, a . Finally, observe that the measure-metric space (E s , |-| , (p Aa ^)E B ) is 
isometrically isomorphic to the measure-metric space (Eq , | • | , (fi a ^ )e ), since the Euclidean 
structure is compatible with translations. We conclude that the best constants in a LSI 
on these spaces coincide, and so to obtain uniform estimates on pls(E s , |-| , (p ACj ^)e s ) in 
s G isupp(//), we must obtain uniform estimates on Pls(Eq,\-\ ,(/i a )£ ) in a G R. Using 
Theorem 3.8, we obtain: 
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Theorem 5.1. Let p = exp(— V(x)) dx denote a probability measure on R with V G C 2 (R) 
and lim x ^± OQ V(x) = oo. Fixing 5 > 0, denote p a := /9r,s(R, H , M a ), := M p (p a ), 
Dfy := Di i y 1 (p a ) and -D^i := ^2 *i (A 4 ")- Assume that: 

MS - 

p a >P>0, (M a)3/2 " M < °° ' ^l.*! " ^ < °° ' ^2,*! < ^2 < OO , 

uniformly in a G R, and i/ioi: 

-« := inf > . 

xGK 8 

T/ien /or n > Th^ Dl,L>2 , the canonical ensemble (E s , |-| , Pe s ) satisfies a LSI, uniformly in 
the system size n and mean-spin value s G R, depending solely on p, M , D\ and Di- 

Proof. By the preceding discussion, it is enough to verify a uniform lower bound on pls(Eq, \ ■ \ 
in a G R. Write p a = exp(— V a (x)) dx, and apply Theorem 3.8 to p a , which is possi- 
ble thanks to the requirement that —p a /8 < — k < V^f for all a G R. Note that always 
M% < C I p a < C/p by the sub-Gaussian tail decay (4.1) guaranteed by the Hebst argument 
(since E(X a ) = 0). The proof is complete. □ 

To demonstrate the desired uniformity for a concrete class of measures p, we recall the 
following definition given in the Introduction: 

Definition. A probability measure p = exp(— V(x)) dx is called (a,j3,uj) weakly Gaussian 
if we may decompose V = V conv + V per t so that: 

• Vconv, Vpert G C 2 (R). 

• V? onv > a > 0. 

• SUp Vpert — inf Vpert < U) < OO. 

• —k := -|aexp(-w) < V" < (3 < oo. 

Lemma 5.2. Let p be (a,/3,u) weakly Gaussian. Then for any a G R: 
1- P a ■= PLs(^,\-\,p a )>aexp(-u). 

2. p a = exp(-V a (x))dx with -p a /8 < V" < /3. 

3. For any 8 > and a G R, the parameters Mf / (Mf) 3 ' 2 , Df^ and associated 
to the measure p a are uniformly bounded above by functions of a, (3 andu. 

Proof. The first assertion is an immediate consequence of the Bakry-Emery condition for 
LSI (see Appendix) and the Holley-Stroock perturbation argument (1.3). The second asser- 
tion is immediate from the definition of a weakly Gaussian measure and the first assertion. 
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For the third assertion, ^ are trivially bounded above by sup^gjj |V^'(a:)| < D 2)OQ 
max(/3, k). As for D^ i , since V' a is Lipschitz (with constant D 2)00 ) and E(V^(X a )) 
J Va(x)dfi a (x) = 0, it follows by the Herbst argument (see Appendix) that: 



E(exp(A|V a / (X a )|)) < E(exp(Ay a / (A a ))) + E(exp(-Ay a '(A a ))) < 2exp 



2p a 



Consequently, choosing A > small enough (uniformly in a £ K thanks to the uniform 
bounds on D 2jOQ and p a ), we see that the right-hand side is bounded by e, implying the 
asserted uniform upper bound on ^ . 

Lastly, we use the fact that V a may be written as W\ — W 2 , where W\ is (strictly) 
convex, J exp(— W\(x))dx = 1, and supW-2 — inf W2 < u. Consequently, supW^ > and 
inf W% < 0. We estimate: 

M 3 (/i a ) J \x\ 3 exp(-Wi(x) + W 2 (x))dx 



M 2 (n a fl 2 (J x 2 exp(-Wi(x) + W 2 {x))dxf' 2 
< exp(sup W 2 mf W 2 ) 



2 / f T2 vnLW, I^WArA^/ 2 



(f x 2 exp(— W\{x))dx) 

The first term on the right hand side above is bounded by exp((3/2)w), and the second term 
is bounded by a universal constant thanks to Theorem A. 4 in the Appendix, which entails 
a reverse Holder inequality for moments of log-concave measures such as exp(— W\{x))dx. 
The proof is complete. □ 

Combining Lemma 5.2 with Theorem 5.1, we obtain: 

Theorem 5.3. Let p be a (a,f3,u) weakly Gaussian measure. Then the canonical ensemble 
(E s , \ -\ ,fiE s ) with mean-spin s satisfies a LSI, uniformly in the system size n > 2 and mean- 
spin value s£l, depending solely on a positive lower bound on a and upper bounds on (3 
and oj. 

Proof. The theorem follows immediately from Lemma 5.2 and Theorem 5.1 when n > rja ■ 
For smaller n, it is an easy consequence of the Bakry-Emery criterion (see Appendix) 
together with the Holley-Stroock perturbation argument (1.3): indeed, the strictly convex 
part V conv of the weakly- Gaussian potential implies that its restriction onto E s satisfies 
LSI (a), and the bounded perturbation V per t can only change the Hamiltonian by at most 
nuj, which is bounded when n < rja . □ 

5.2 Weakly Interacting Conservative Spin Model 

Analogously, let pa,e s denote the conditioning of the grand canonical weakly-interacting 
ensemble \ia to S = s, namely: 

dfiA,E s = — exp(-H A (x)) dvol Es (x) , 
Z A,E S 



37 



where Za,e s > is a normalization term. In fact, in applications, it is useful to consider 
our n-particle configuration (x\,... subset of a larger iV-particle configuration 

(xi, . . . ,x n , y n -\-i, . . ■ , Vn)-> with which it interacts via an additional term I^{x,y) in the 
Hamiltonian: 

n N 

H A,( y )(x) ■■= H(x) - I A (x) - In(x,v) , In{x,v) = ^2 /2 a hj x iVj ■ 

i=l j=n+l 

Setting hi = - Ylf= n +i a ijyj and b = (&i> • • • > &n)> we define: 

n 

H A)b (x) := H A) ( y )(x) = y*X v ( x i) + b i x i) - !a(x) , 

i=i 

and introduce the canonical weakly-interacting ensemble with mean-spin s and boundary 
contribution b = (b±, . . . , b n ): 

dHA,E a ,b = -77— — ex.p(-H Atb (x)) dvol Es ( x ) ■ 
A A,E s ,b 

As before, note that: 

(V AU )A,E s ,b = VA,E a ,b Vli £ 8 . (5.1) 
However, contrary to the non-interacting case, the commutation with the operation of 
translation is not as nice. Indeed, recall that we denote the translation function on R n by 
T (x) = x — t, and note that given t = (t\, . . . , t n ): 

dx dxi Z 

i=i 1 

Consequently, to apply Cramer's trick, we need a much more delicate argument than in the 
non-interacting case: 

Lemma 5.4. Let X a denote a random-variable distributed according to fi a , and assume that 
M2 := sup ag iR Var(X a ) < 00. Then for any n by n symmetric matrix with zero diagonal 
so that \\A\\ < 1/(2M2), and for any mean-spin s £ R and boundary contribution b = 
(61,..., b n ) € R n , there exists a tilt uq € R and translation vector (ti, . . . , t n ), so that 
J £i=l*< = s and so that the barycenter of (T* l )*(/U A(6l+ " 0+2E ^ 1 ai - jtj) ) is at for all 
i = 1, . . . , n. 

Proof. Let F(a) (previously denoted s(a)) denote the barycenter of /i Aa , and recall that this 
is an increasing function. Moreover, observe that F'(a) = Var(X a ) G (0, M-2[. We would 
like to show that given A, s and b as in the assumption, there exists a solution uq 6 R, 
t = (t\, . . . ,t n ) S R n to the following system of n + 1 non-linear equations: 

n 

Fi(u ,t) := F(zi(u ,t)) := F(b i + u + 2^2a i jt j ) = t { , i = 1, . . . , n , 

i=i 

n 

U = sn . 

i=i 
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Recall the definition of the hyperplane E s = {t G M. n ; X^f=i U = sn }- Given t G M n , we 
define uq = uo(t) to be the unique element of M so that 

G{t) := (F 1 (u (t),t),...,F n (u (t),t)) G E s . 

Since F is strictly increasing, it is immediate to verify that ito(t) is indeed well-defined. Our 
goal is to find a solution t G E s to the equation G(t) = t. We will show that under our 
assumptions, G is a strict contraction on E s , when the latter space is equipped with the 
induced Euclidean structure from (M n , |-|), and hence the existence of a (unique) solution 
will follow immediately from Banach's fixed point theorem. 

To show that G contracts Euclidean distance on E s , we calculate the Jacobian matrix 
dG/dt = {dG t /dtj} i j=1> n at t: 

Next, observe that since: 

n 

Y J F(z k (u (t),t)) = sn , 

k=l 

differentiating in tj reveals that at (uq (t) , t) : 

^° £ F'(z k ) + £ F'(z k )2a kJ = , j = 1, . . . , n . (5.4) 



k=l k=l 



Denoting F'{z) = (F'(zi), . . . ,F'(z n )) G W 1 and combining (5.4) with (5.3), we obtain in 
matrix form: 

f = ^W)(-p4^Row(F'(,)).2^ + 2^ 
= f Diag(f W) - ||JV( * f(z) 8 F'(z) j 2A , 

where Diag(-F'(z)) denotes the diagonal matrix with F'(z) as its diagonal, Row(i 7 "(2;)) 
denotes the n by n matrix having identical rows equal to F'(z), and we have used that 
F' > to dispense of the absolute values in H-F^z)^ . Note that as positive definite 
matrices: 

< Diag(F'(z)) < ||-F'(z)|L Id < M 2 Id , 

II II -too 

and that by Holder's inequality: 

- wm^ 8 F ' {z) - l w0t Id - l|F ' wll '» u - md ■ 
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and so consequently: 

-Mild <B := Diag(F'(z)) - 1 F'(z) ® F'(z) < M 2 Id . 

It follows from this, (5.5) and our assumption that < 1/(2M 2 ) that: 

HrfGMIlop (t) < 2 ||B|| op P|| op < 2M 2 ||A|| op < A < 1 , Vt G E s , 
and the desired contraction property is demonstrated, concluding the proof. □ 
Given u = (u±, . . . , u n ) S W 1 , let us denote: 

,,u ,o.n ,,Ui 

» ■= ®^ ' -&r '= zi Eo dvolEo(x) ' 

where recall p a denotes the translated p Aa with barycenter at the origin. An immediate 
consequence of (5.1), (5.2) and Lemma 5.4 is: 

Corollary 5.5. With the assumptions of lemma 5.4, for any mean-spin s£R and boundary 
contribution b = (b±, . . . ,b n ) S M. n , there exist tilts u = (u\, . . . ,u n ) E W 1 so that (E s , \ ■ 
\,fJ>A,E 3 ,b) is isometrically isomorphic as a measure-metric space to {Eq, \ ■ \,P>ae )- In 
particular: 

Although the results described in Sections 3 and 4 were proved for the case of identically 
distributed independent random variables X±, . . . ,X n , each having law p, all of the proofs 
carry through mutatis mutandis to the case of non-identically distributed independent r.v.'s. 
This is thanks to the general formulation of the central tools we have used - Theorems 3.1, 
3.5 and 4.1 - which did not assume identical distribution, only uniform upper bounds on 
the appropriate parameters. Consequently, all of the results of those sections carry through 
to the case when p n , pe and pa,e are replaced by p u , p$ Eq and p\ Eq , respectively, when 
the parameters: 

are replaced by: 

supM 2 ( M a ), sup *ff P 2l2 > sup GO , sup GO, mfpGO, 



respectively. Combining this with Corollary 5.5, we obtain: 

Theorem 5.6. let p = exp(— V(x)) dx be a probability measure on K. with V £ C 2 
and lim x ^± 00 V (x) = oo. Fixing 5 > 0, denote p a : = pLg(R, |-| , p a ), := M p (p a ) 
Df iSfi := Di^ 1 (p a ) and := D^^p" 1 ). Assume that: 

M a 

p a >P>0, {Ma 3 )3/2 < M < oo , < D 1 < oo , Dfa < D 2 < oo , 
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uniformly in a 6R, and that: 



— k := inf V"(x) > . 

Xei A denote an n by n symmetric matrix with zero diagonal satisfying: 

\\Mo P < CP » (5-5) 

for some appropriately chosen universal constant c > 0. Then for any n > r]^ Dl,D ' 2 , the 
canonical ensemble with weak-interaction (E s , |-| ,/j,ae s b) satisfies LSI, uniformly in the 
system size n, mean-spin value s£l and boundary contribution b £ W 1 , depending solely 
on p, M, D\, D2 and \\A\\ HS / p. 

Proof. Choosing the constant c > in (5.5) small enough, the sub-Gaussian tail decay (4.1) 
guaranteed by the Hebst argument (since K(X a ) = 0) ensures that: 

C 1 
\\A\\ op <cp<c—< 



M 2 2M 2 



Consequently, the assumptions of Corollary 5.5 are satisfied, and so it is enough to verify 
a uniform lower bound on pls{Eq, \-\ , p\ E ) in u G W 1 . We furthermore require that 
c > in (5.5) is smaller than what is required in (4.8) to apply Theorem 4.4. Write p a = 
exp(— V a (x))dx, and apply Theorem 4.4 to p u , which is possible thanks to the requirement 
that — p/8 < —k < V" for all o£l. Note again that M 2 < C/p by the sub-Gaussian tail 
decay (4.1). The proof is complete. □ 

Combining Lemma 5.2 with Theorem 5.6, we obtain: 

Theorem 5.7. Let p be a (a,f3,ca) weakly Gaussian measure, and let A denote an n by n 
symmetric matrix with zero diagonal satisfying: 

\\A\\ op ^ caexp(-w) , 

for an appropriate universal constant c > 0. Then the canonical ensemble (E s , \-\ ,pA,E a ,b) 
with weak-interaction A, satisfies a LSI, uniformly in the system size n > 2, mean-spin 
value s £ K. and boundary contribution b £ M n , depending solely on a positive lower bound 
on a and upper bounds on (3, uj and \\A\\ HS /(aexp(— a;)). 

Proof. The theorem follows from Lemma 5.2 and Theorem 5.6 when n > rja- For smaller 
n, it is an easy consequence of the Bakry-Emery criterion (see Appendix) together with 
the Holley-Stroock perturbation argument (1.3): indeed, note that ||^4|| op < cq, and so the 
strictly convex part V^onv of the weakly-Gaussian potential, is still strictly convex even after 
adding the linear boundary term and the weak interactions if we assume that c < 1, since 
aid — A > (1 — c)ald as positive definite matrices. Consequently, so is the restriction onto 
E s , and by the Bakry-Emery criterion, satisfies LSI((1 — c)a). The bounded perturbation 
Vp er t can only change the Hamiltonian by at most noj, which is bounded when n < rja^- Q 
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6 Spectral-Gap of Conservative Spin Model with Convex Po- 
tential 



We now turn to study the spectral-gap of a canonical ensemble, having a convex potential. 
The prime example we have in mind, which was suggested to us by Pietro Caputo, is the 
two-sided exponential measure v = ^ exp(— \x\) dx. More generally, recall that a measure 
p = exp(— V(x))dx on R n is called log-concave when V : M. n — > R U {00} is convex. Note 
that for a one-dimensional log-concave probability measure p on R, it is known (see Theorem 
A. 4 in the Appendix) that: 

^(R,|-U)-^y, (6-1) 

where recall we use A ~ B to denote that c\ < Aj B < C2 for some two universal constants 
ci,c 2 > 0. 



6.1 Zero Mean-Spin Case - Bounded System Size 

First, we take care of the zero mean-spin case when the system size n is bounded above: 

Proposition 6.1. Let p denote a log-concave probability measure on R with barycenter at 
0. Then for any integer n > 2: 

Cp SG (M,\ ■ \,p)> PSG (E,\- \,p E ) > -psg(M,\- M , 

n 

where C, c > are universal constants. 

The recent paper [2] gives a better estimate involving a factor log(n) -2 instead of n . 
However this improvement is not essential for our purpose. The proof of the proposition is 
based on the following: 

Lemma 6.2. Let p denote a log-concave probability measure on R with barycenter at 0. 
Let X = (X 1 , . . . ,X n ) denote the random vector in R n distributed according to the prod- 
uct measure fi n , and let Xe = (X^, . . . ,Xg) denote its conditioning on the hyperplane 
Y17=l ^ = 0; *- e - having law pe- Then: 

Var(X 1 ) ~ Var(A^) . 

Ln other words, the variance of the original measure p is universally equivalent to the vari- 
ance of the marginal of the conditioned measure p e ■ 

Proof. Write p = f(x)dx. As usual, we denote the law of Ya=i ^ ^ l^n = 9n(x)dx, 
and let p l E = h n {x)dx denote the law of X\. Since p n is log-concave, then so is pe, and by 
the Prekopa-Leindler Theorem (e.g. [16]), so are their marginals p® and p\. In addition, 
the barycenter of p% is at the origin (by linearity of the projection), and so is the barycenter 
of p\ (by symmetry: E(A^) = E{X l \ £™ =1 X i = 0) = 0). 



42 



It is known (see Theorem A. 4) that for a log-concave probability measure v = w{x)dx 
with barycenter at the origin: 

Var M * . (6.2) 

Consequently, we just need to show that /(0) ~ h n (0). Indeed: 



hn(0) 



e->0+ le 



Xl\ < e) 



lim — I 

£,<5-s-o+ 2e 



IX 1 ! < e 



i=i 



lim 

e,<5-s-o+ 2e 



/(0) lim 



1 PflX 1 ! < e A iEr=i^i < 6 ) 



\U=i^\<5) 



p ( 


1 "V« 
V ^T 2^=2 ^ 


< A-l 

— yfn-l) 


p ( 


1 v^ n Vi 


<-f) 



/(0) hm 
= /(0) 



£4j 

C = 2^|<^) 

^o+p(iEr=i^i<^) 

n £n-l(0) 



- 1 5n(0) 



and so it remains to show that g n _i(0) — ffn(0)- Since is log-concave itself, it follows 
from (6.2) that for any integer k > 1: 



5fc(0) 2 

thereby completing the proof. 



VarOif ) = Var f -L £ x*) = VarpC 1 ; 



8=1 



□ 



Proof of Proposition 6.1. Since is log-concave, it follows by the isoperimetric bound of 

Kannan-Lovasz-Simonovits [22] coupled with Cheeger's inequality [12] (see e.g. [38] for 
more information) that: 

Psg(E, I ■ |,/xe) > 



f\x\^ E (x) E(Er=i(^) 2 ) n\ a v(X E ) ' 

for some universal constant c > 0. Appealing to Lemma 6.2 and (6.1), we observe that 
Var(X E ) ~ Var(X x ) ~ \ • |, /-<)> thereby concluding the proof of the right-hand side 

inequality. 

Denoting as usual by Xe the random vector distributed according to he, and testing the 
spectral-gap inequality on (E, \ ■ \ ,(jL E ) with the function E B x = {x±, . . . , x n ) i->- f(x) = x±, 
we have by Lemma 6.2 and (6.1) that: 



PSg(E, I • \,fJ, E ) < 



mf(x E )\ 2 

Var(/(X S )) 



n — 1 



n Var(A^) Var(X r 



- Psg( 



as asserted. 



□ 



Remark 6.3. In [22], Kannan, Lovasz and Simonovits proposed a daring conjecture, now 
commonly referred to as the KLS conjecture. It predicts that for log-concave probability 
measures on any Euclidean space, the spectral-gap can be evaluated up to dimension free 
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constants by just testing the inequality on linear functions. In the case of the measure 
Pe, since the coordinates play symmetric roles, the covariance of pe is a multiple of the 
identity of E. Hence all (non-zero) linear functions give rise to the same Rayleigh quotient. 
Therefore if p is log-concave, and with the notation f(x) = x\ of the latter proof, the KLS 
conjecture predicts that: 

« E\Vf(X E )\ 2 n- 1 1 1 
PSG ^ 1 • ^ " Var (/ pr E )) = n Var(^) " Var^) " PSG ^ 1 " ^ ' 

6.2 Zero Mean-Spin Case - General System Size 

Theorem 6.4. Let p = exp(— V(x))dx denote a log-concave probability measure with 
barycenter at so that V € C 2 (]R) and D\^ 1 = Di^^p) < oo, and let p = p5c(R, | • 
Then for any integer n > 2: 

/0sg(£, I • I, Me) > c- 



(l + log(Q)) 2 ' 

where c > is a universal constant and Q is the following scale- invariant quantity: 



Q := max(l, y / Var(//)L'i i ^ 1 (p)) . (6.3) 

Proof. It is well-known (see the Appendix) that the spectral-gap inequality tensorizes with 
respect to the Euclidean (£2) norm, and so (R n , | • | , fj, n ) satisfies a spectral-gap inequality with 
the same constant p. Since dp n {x) = exp(—H(x))dx with H(x) = ^r=i^( 2 '*)' ^ follows 
that Hess-ff > as a tensor field in K n . The same bound holds for its restriction onto 
any linear subspace, and so it follows that (E, \ ■ \,pe) satisfies our convexity assumptions. 
Moreover, the uniform thickening of pe m the direction D orthogonal to E only adds as 
an eigenvalue to the Hessian matrix in that direction, and hence (W n , \ ■ \ , pe,w) also satisfies 
our convexity assumptions for any w > 0. 

We now transfer the spectral-gap inequality on (M n , | • \,p n ) onto (W 1 , \ ■ \,pe,w ) by 
applying Theorem 2.7 with p = 4. Note that necessarily A := \\dpj dx\\ Loo < 00 and that 
M^/M^ 2 and M3A 3 are bounded above by universal constants, thanks to Theorem A. 4. 
Applying Corollary 3.4 with k = 0, it follows that whenever n exceeds some universal 
constant 770, the following estimate holds: 

1/4 

< Cmax(l, y/HhD\&i) = CQ , 
where C > is some universal constant and: 





Theorem 2.7 therefore implies that: 



PSG^A ■ \,VE,w ) > C - 2 , 

(l + log(Q)) 
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for some universal constant c > 0. By the tensorization property of the spectral-gap in- 
equality: 

PsgO^M • \,VE,w ) =mm(p S G(E,\ ■ |, he), Psg0&, I ' \,V[-w ,w ])) > 
where v\- WOiWo \ denotes the uniform measure on [—wq,wq]. It follows that when n > t]q: 

PSg(E,\ ■ I He) > PSG^A ■ \,PE,w ) > C - P 2 , 

(l + log(Q)) 

as asserted. The case when n < rjo is handled by Proposition 6.1. This concludes the 
proof. □ 

Remark 6.5. Observe that the quantity Di i $ 1 (/x) may be infinite, e.g. for V(x) = e x +c, 
and so in particular the scale invariant quantity y / Var(/i)L'i jl i, 1 (/i) appearing in (6.3) is not 
bounded above in the class of log-concave probability measures. However, this quantity 
is always bounded away from 0: to see this, set D\^{h) := ll^'lli,2(^)> use 2exp(|t|) > t 2 , 
Cauchy-Schwartz and integrate by parts: 



oo 



^y/Vaiif^D^ifi) > v /Var(/x)D 1)2 ( M ) = ( / t 2 dn{t) \ {V') 2 dfi 

/•oo 

> / tV'(t)dfj,(t) = 



Lastly, we mention that even the scale invariant quantity ^/Var(//)Z>i i 2(/x) is not uniformly 
bounded above in the class of log-concave probability measures: for potentials of the form 
V(x) = \x\ p + Cp, it is equivalent (up to constants) to p 1 ^ 2 when p is large. 

Corollary 6.6. Let p = exp(— V(x)) dx denote a log-concave probability measure with 
bary center at 0. Assume that either: 

1. V is Lipschitz with constant L. 

2. V £ C 1 (M) and V is Lipschitz with constant L 2 . 
Let p = psg(^-> I • I j p)- Then for any integer n > 2: 

Cp > Psg{E, \-\,Pe)>c- 



'log(2 + L 2 /p) 2 ' 
where C, c > are universal constants. 

Proof. It is easy to verify that it suffices to prove the claim when V G C 2 (M). Indeed, any 
convex V as in the first (respectively, second) case may be approximated in the maximum 
norm by convex functions V m G C 2 (M) so that the || V^J| ioo (respectively, || V^|| ioo ) converges 
to at most L (respectively, L 2 ). Since the spectral-gap is stable under convergence of the 
potential in the maximum norm, the reduction follows. 
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Consequently, by Theorem 6.4, it is enough to bound Q given by (6.3) from above. First, 
observe that c\ < Var(/i)p < C2 according to Theorem A. 4. In the first case, we obviously 
have Di^^fi) = \\V'(X )\\^ i < \\V'(X )\\ LOO < L, and hence Q < max(l, ^/c^L/^/p). In 
the second case, we know by the result of M. Gromov and V. Milman [17] that spectral-gap 
implies exponential concentration of Lipschitz functions about their mean, and since V 
is L 2 -Lipschitz with E(V'(X )) = 0, it follows that D^^/j,) = \\V'{X )\\^ i < CL 2 /^/p, 
and hence Q < max(l, ^fc^CI? j p). In either case, the asserted lower bound follows from 
Theorem 6.4. The upper bound follows from Proposition 6.1. □ 



6.3 Dependence on Mean-Spin 

Given a log-concave probability measure p = exp(— V{x)) dx on R, we denote a± = 
\im x ^.± 00 (V (x) — V(0))/x (the latter function is monotone and hence the limits exist in the 
wide sense). Consequently, the probability measure p Aa is well defined for all a £ (a„,a + ), 
and as usual, we denote by X Aa the random variable distributed according to p Aa . Again, we 
denote s(a) := E(X Aa ), which always exists since p Aa is still a log-concave probability mea- 
sure, and hence has exponential tail-decay. The function (a_,a + ) 3o4 s(a) £ isupp(/x) is 
increasing and onto, and we denote its inverse by a(s). 

Applying the Cramer trick as in Section 5, we immediately see that: 

Psg(E Si I • |,MeJ = Psg(E , I • |,Me s) ) Vs £ isupp(/i) , (6.4) 

where recall [i a is the translation of \x Aa having barycenter at the origin. As a consequence 
of Corollary 6.6, we obtain: 

Theorem 6.7. Let fi = exp(— V(x)) dx denote a log-concave probability measure on M. 
Assume that either: 

1. V is Lipschitz with constant L. 

2. V £ C 1 (IR) and V is Lipschitz with constant L? . 

Given s £ M, denote p s := psgQ&i I ■ \,P a ^)- Then for any integer n > 2 and mean-spin 
s £ R: 

Cp s > Psg(E s ,\ -\,He 3 ) > c- — Ps 2 2 , (6.5) 

log(2 + L 2 /p s y 

where C, c > are universal constants. 

Proof. Write p a = exp(— V a (x))dx for a £ (a_,a + ). In the first case, since obviously 
|a_|,a+ < L, then V a is Lipschitz with constant 2L. In the second case, since by definition 
V a (x — s(a)) = V(x) — ax + c a , we see that the Lipschitz constant of V' a is identical to that 
of V . In either case, the assertion follows from (6.4) and Corollary 6.6. □ 

Remark 6.8. The left-hand bound in (6.5), namely CpsgQ^, I • \,P a ^) > PSg(E s , \ ■ \,pe s ), 
is true without any restriction on the log-concave measure p, as follows from Proposition 6.1 
and (6.4). The KLS conjecture (see Remark 6.3) predicts that the logarithmic term in the 
right-hand bound in (6.5) and the technical restrictions on the log-concave measure p may 
be removed. 
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We conclude this subsection with an estimate of the one-dimensional spectral-gaps ap- 
pearing in the previous results. It is optimal up to constants, as witnessed by the example 
of the two-sided exponential measure, studied in the next subsection. 

Proposition 6.9. Let f/, be a log-concave measure on M and let X be a random variable 
distributed according to fi. Then for all s G isupp(//): 

p SG ^,\-\^ a{s) )> 



Var(X) + s 2 ' 
where c> is a universal constant. 

Proof. Assume as we may that is the barycenter of [i. By (6.1), our goal is to prove that 
for all s G issup(/i): 

Var(X Afl W) < C(Var(X) + s 2 ) , 
for some universal constant C. This is equivalent to showing that for all a G (a_,a + ): 

Var(X Aa ) < C(Yar{X) + s(a) 2 ) = C(Var(X) + (EX Aa ) 2 ) . (6.6) 

It suffices to prove this for all a G [0, a + ), as we may apply it to fj, o (—Id). We claim that 
for all a G [0, a + ), the following holds: 

J \t\e ta dfi(t) < y / E{X 2 ) J e ta dfi(t) + J te ta dn(t) . (6.7) 



Indeed, this is true for a = since E|X| < ^E(X 2 ) and / tdfi(t) = EX = 0. Now set 
(p(a) = f te ta dfi(t), and note that <^(0) = and ip'(a) = f t 2 e ta dfi(t) > 0. Hence for all 
a G [0, a + ), tp(a) > 0. For a G [0, a + ), the derivative of the right-hand side of (6.7) is equal 
to: 



y/E(X2j<p(a) + J t 2 e ta dfi(t) . 

This no less than the derivative of the left-hand side of (6.7): J i 2 sign(i)e ta dp(t) . Hence we 
have proved (6.7), which can be rephrased as E\X Aa \ < yjE(X 2 ) + EX Aa . Using (a + b) 2 < 
2(a 2 + b 2 ) and E((X Aa ) 2 ) < C(E|X Aa |) 2 (see Theorem A.4), we obtain: 



E((X Aa ) 2 ) < C' (E(X 2 ) + (EX Aa ' 
This obviously implies (6.6) and consequently completes the proof. □ 
6.4 The Two-Sided Exponential Measure 

In the special case of the two-sided exponential measure v = ^ exp(— |x|) dx, we can indeed 
confirm the KLS conjecture for (E s , | • |, ve s )- Note that it is easy to check that if a G (—1,1) 
then Var(/i a ) ~ (1 + s(a) 2 ), and therefore pscQ^, I • |,^ a(s) ) - 1/(1 + s 2 ). 
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Theorem 6.10. Let v = \ exp(— |x|) dx. Then the canonical ensemble (E s , |-|, ve 3 ) satisfies: 

Psg(E s , I • \,VEs 



1 + s 2 ' 

uniformly in the system-size n > 2 and mean-spin s € R. 

Proof. The upper bound on psg{E s , | • |, z^ s ) follows e.g. from Remark 6.8. As for the lower 
bound, since V(x) = |x| + log(2) is 1-Lipschitz on R, Theorem 6.7 implies that: 

c 

PSGl^s, • > 7— — — — ——2 • 

((2 + | S |)log(2 + | S |)) 2 

Consequently, the asserted lower bound follows when \s\ < C2, for any fixed constant C2 > 0, 
and it remains to verify it (by symmetry) when s > C2. 

Observe that ve s has constant density on the simplex A ns , defined as: 

A n , s '■= E s n R™ , 

where Wf denotes the positive orthant. Denoting by Aa„ a the uniform measure on A ns , 
we conclude that: 

VE s \A n , s 
A "" " V Es {^n,s) ■ 

It is well-known (see e.g. [3]) that uniformly in n > 2: 

PSG(An,l, I • I, Aa„,J > c , 
where c > is a universal constant, and hence by scaling: 

PSG(A n , s , I • I, Aa„,J > -o • 

We will deduce the desired assertion by transferring this bound on the spectral-gap from 
Aa„ s onto the entire f£ a . Since both measures are log-concave on E s , we employ the trans- 
ference principle given by case 2 of Theorem 2.5, which requires control over ||dAA„ JdvE a \\ Lx 
We conclude that the remaining part of the assertion will follow if we show for instance that: 

u Es (A n>s ) > 1/2 V?i>2 Vs>C 2 >0. (6.8) 

This boils down to a simple calculation, which we now verify. 
Indeed, denote: 

A(n- l,r) := |x G R™- 1 ;^^^ < r| . 

Now write: 

v Ee {Es \ An,,) 3 h k^\A n _ hsn ex P ( " ££1 \xi\ ~ \sn - J27=i x i\)dx 



VE s (A n s ) J± J. exp(-sn)da 
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where J is an appropriate Jacobian factor. It is immediate to verify that Vol(A(m, 1)) 
1/m!, and hence by scaling: 



(n- 1)! 
(sn)"" 1 

(n-l)! 
(sn) 71 - 1 



71 l,sn 



exp sn 



n-l 



i=i 

n-l 

exp | —2 ( sn — x, 

i=i 



72-1 \ 

sn — Xj | J dx 
i=i J 

n-l \ 



Y, 2 i x i)- dx 



i=l 



where a_ := (|a| — a)/2 = max(— a, 0). Since: 

n— 1 f n— 1 

M™" 1 \ A n _i, sn C (J {xi <0}U <Y, Xi - sn 

i=i U=i 

it follows by the union bound that: 

v Ea (E s \A n<s ) < (n-l)! /•°° e - 2 ^ = "! 
^(A n , s ) " (sn)"" 1 7 2(sn)"- 1 " 

In particular, the latter ratio is bounded above by 1 (in fact, 1/2) whenever e.g. s > 1, 
yielding (6.8), as desired, thereby concluding the proof. □ 




6.5 Further Remarks 

As witnessed by Theorems 2.5, 2.7 and 2.8, there are many possibilities for transferring 
the spectral-gap estimate from the product measure \i n onto the thickened conditioned 
measure ^e,w We chose to present above the most convenient possibility for handling the 
two-sided exponential measure, which was to employ Theorem 2.7 coupled with the estimate 
already obtained in Proposition 3.3, leading to a dependence on the parameter Z)i ) «j 1 ( / u). 
However, several other possibilities, each having its own advantages and disadvantages, are 
possible. In particular, we mention that it is possible to naively estimate fi n | d ^' w — ^} 
by repeating the argument of Proposition 3.3 and simply using Chebyshev's inequality 
(instead of Bernstein's Theorem 3.1), resulting in a dependence on the parameter D\ t 2 = 
\\V\\l 2 (u) = J\\V"\\v-{p)> i ns t ea d of the more complicated Di^ 1 (fi) (cf. Remark 6.5). 
However, this does not lead to a bound on dxv ("m ^E,w), since Lemma 2.6 requires control 
over fi n {(l| 7ri3 (a;)|< lu d/%)/dA t B,w > t}, to which end one would also need to control D|(/x). 
And controlling V" from above is problematic since this prevents approximating non-smooth 
densities such as that of the two-sided exponential measure. 

Consequently, we also point out another method for directly handling non C 2 densities, 
by proposing an alternative to the second-order Taylor expansion of V(x + e) which was 
crucially used in the proof of Proposition 3.6. 

Definition. Let V : R — > R denote a locally Lipschitz function. Given e ^ 0, define: 
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• W £ (x) = V{x + e) - V{x). 

. W e {x) = l-exp(-^ W) j 

. W e {x) = exp(-W(g)-l + W(: 

It is immediate to verify that: 



e 2 



V(x + e) - V{x) = eW((x) + — W$(x) , 



/ 



and that: 

f Wf(x) exp(-V(x))dx = Ve / . 

It is also easy to check that if V is Lipschitz with constant L, then: 

V |e| G (0, l/L] VxeR \W[(x)\ < CL , \W£(x)\ < CL 2 , (6.9) 

for some universal constant C > 0. If V is only locally Lipschitz, then we have the less 
useful: 

V|e| G (0,1] Vx e R |Wf(x)| < L(x)exp(L(x)) , < 2L(x) 2 exp(L(x)) , (6.10) 

where L(x) := sup| £ | e(0jl] \ v ( x + £ )~ v ( x )\ , 

It is now immediate to verify that we may use the above decomposition instead of the 
second-order Taylor expansion in the proof of Proposition 3.6, with the constants Di^ 1 , 
Z?2 ^ and D2 replaced by the following ones (respectively): 

D™1:= sup \\Wf(X )\\ sup \\WI(X )\\ , D?> 5 := sup E(|Wf(X )|) , 

|e|G(0,<5] 1 |e|6(0,<J] 1 |e|e(0,<5] 

where as usual Xo is distributed with law /x = exp(— V(x))dx. Since we need our bounds 
to hold uniformly on |e| S (0, 5], and since the ones given by (6.10) are very bad, it is most 
convenient to consider the simplest case of a potential V having global Lipschitz constant 
L > 0, in which case (6.9) implies that: 

\fd < l/L < CL , £>J£ < CL 2 , Z)f' 5 < CL 2 . 

The rest of the proof of Proposition 3.6 remains unchanged. 



Appendix 

A.l Useful facts about log-Sobolev and spectral-gap inequalities 

Theorem A.l (Bakry-Emery [1]). Let (M,g) be a complete smooth oriented Riemannian 
manifold, equipped with its geodesic distance metric. Let p, = exp(— ip(x)) dvo\M(x) denote 
a probability measure on (M,g), with tp G C 2 (M), and assume that as tensor fields: 

Bp > Ric 9 + Hessgijj > pg . 

Then (M,g,p) satisfies LSI(p). 
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Theorem A. 2 (Herbst Argument (e.g. [28])). Assume that (Q,d,p) satisfies LSI(p). Then 
the following Laplace-functional inequality holds: 

J exp(Xf)dp < exp(A 2 /(2p)) VA > V 1-Lipschitz f s.t. j fdp = . 

Theorem A. 3 (Tensorization (e.g. [28])). Let (fij, dj, /ij), i = 1,2 6e Riemannian mani- 
folds, equipped with their geodesic distance and an absolutely continuous probability measure. 
Observe that in this case, the geodesic distance on the product manifols is given by the £2 
product metric 

V(xj,yi) G Hi x f2 2 di ® d 2 ((xi,yi), (x 2 ,y 2 )) := \AM:ri, x a) 2 + d 2 (yi,y2) 2 ■ 

Assume that (flj, dj, //j) satisfies LSI(pi), i = 1,2. TTiera i/ie product measure-metric space 
(f2i x f2 2 ,di <S> c?2 , ® M2) satisfies LSI(mm(pi, p 2 ))- Tfte same statement holds for the 
spectral-gap inequality, with LSI replaced by SG in all occurrences above. 

A. 2 Useful facts about log-concave measures 

Theorem A. 4. Lei /i = f{x)dx denote a log-concave probability measure on R. Then: 

1. M q (p)\ < C^M p (p)p for alll<p<q< 00. 

2. ci < psg(R, I • |,M)Var(/*) < c 2 . 

5. If the barycenter of p is at the origin then /(0) > H/H^oo /e. 
^. C! < H/ll^ Var( M ) <c 2 . 
i/ere C, ci , c 2 > are universal numeric constants. 

Proof. The first assertion is a well-known Kahane-Khintchine inequality, which entails a 
reverse Holder inequality for moments of linear (and more generally, homogeneous convex) 
functionals on the class of log-concave measures (see Berwald [4]), or deduce this from 
Borell's lemma [8] as in [45, Appendix III]). Note that the barycenter of p may not be at 
the origin, but this is not required for obtaining the asserted reverse Holder inequality. The 
second assertion is due to Bobkov [5]. The third assertion may be found in [15], and the 
fourth one may be found in [44] when / is even and in [23, Lemmas 2.5 and 2.6] or [47, 
Lemma 3.2 and (3.12)] in the general case. □ 

A. 3 Order Two Sub-Gaussian Chaos 

Let us now present the proof of Theorem 4.1, communicated to us by Rafal Latala. We 
greatly thank him for allowing us to include it here. It will be convenient to employ the 
following: 
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Definition. A real-valued random variable X is called a -sub- Gaussian (a > 0), if\\X\\ 2k < 
a ||G|| 2 fc for all integers k > 1, where G denotes a standard Gaussian random-variable, and 
\\Y\\ p := mY\P) l /P. 

An elementary calculation verifies that there is a universal constant C > with: 
E(exp(AA)) < exp(A 2 /(2p)) VAgR => X is C/^p-sub-Gaussian . 
Hence Theorem 4.1 follows from (and in fact is equivalent to) the following: 

Theorem A. 5 (Latala). Let a > 0, and let X±, . . . ,X n denote a sequence of independent 
random-variables on K, so that for each i = 1, . . . , n, E(Xj) = and Xj is a-sub-Gaussian. 
There exists a universal constant c > so that for any integer n > 1 and n by n symmetric 
matrix A = {a^j} with zero diagonal: 



> i < 2 exp 



-cmm 



« 4 II4II 2 'a 2 ||A|| 



V< > . 



For the proof, we require the following two intermediate steps: 

Lemma A. 6. Let X±, . . . , X n be as in Theorem A. 5 and let G±, . . . , G n denote independent 
standard Gaussian random-variables. Then for any (a\, . . . , a n ) S M n and p > 2: 



i=l 



< 4a 



i=i 



Proof. Let {A^}" =1 denote independent copies of {Aj}" =1 , and let {£j}" =1 denote an inde- 
pendent sequence of (symmetric) Bernoulli ±1 random variables. Then by the Contraction 
Principle (e.g. [31]): 



n 

Ys^Xi 


< 


n 

^a^-Xl) 




n 

^2aiEi{Xi - X'i) 


< 2 


n 

a-i^iXi 


i=l 


P 


i=l 


V 


i=i 


V 


i=l 



Since {£jXj}" =1 is a sequence of independent symmetric a-sub-Gaussian r.v.'s, it follows 
immediately by algebraic expansion and vanishing of the odd coefficients that for any integer 
k > 1: 



y^ a-iEiXi 



i=l 



< a 



2 k 



^aiG, 



i=l 



2k 



and so the assertion follows for p = 2k. When p > 2 is not an integer, we simply choose an 
integer k so that 2k — 2 < p < 2k, and evaluate: 



< 



y^ a-iEiXi 



< a 



y^ajGj 



< 



i=\ 



i=l 



2 k 



i=l 



2k 



'2k -1 
p — 1 



-a 



< 2a 



i=l 



i=l 



□ 
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Proposition A. 7. Let {Xi}™ =1 , {Gi}™ =1 and {oij}f,- = i be as in the Lemma and Theorem 
above. Then for any p > 2: 



<Ca 2 



X/ a ijGiGj 

i,j =1 



where C > is a universal constant 



Proof. Let {^}" =1 and {G^}™ =1 denote independent copies of {Xj}" =1 and {Gj}" =1 , respec- 
tively. Then: 



Z2 a i,j X i X 3 



< Ci 



i,j =1 



< iecW 



/] a i,jGiG'j 



< 16CiC 2 a 2 



y~] a ijGi,Gj 



where the first and third inequalities are decoupling inequalities due to de la Peha and 
Montgomery-Smith [13], and the middle one follows from separability and the Lemma. □ 

Proof of Theorem A. 5. By the above Proposition and the Hanson-Wright estimates on mo- 
ments of Gaussian Chaoses of order 2 [20], we have for any p > 2: 



< Co? 



ai,jGiG 3 



<C'a 2 (p\\A\\+^p\\A 



Op 



\HS) 



The asserted tail decay estimate now follows by a standard application of Chebyshev's 
inequality and optimization on p. 

Note that the Hanson- Wright estimates in [20] are in fact valid for any symmetric sub- 
Gaussian distribution, and yield a dependence on || \A\ || (where \A\ is the matrix with 
entries if A = {aij}). However, it is important for our purposes to apply this result 

to non-symmetric distributions, explaining the symmetrization procedure carried out above. 
Furthermore, a better reference for the dependence on ||^4|| op as opposed to || \A\ \\ above 



is e.g. Latala [25]. 



□ 
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